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POINTWISE ESTIMATES AND REGULARITY IN GEOMETRIC OPTICS 
AND OTHER GENERATED JACOBIAN EQUATIONS 

NESTOR GUILLEN AND JUN KITAGAWA 


Abstract. The study of reflector surfaces in geometric optics necessitates the analysis of cer¬ 
tain nonlinear equations of Monge-Ampere type known as generated Jacobian equations. This 
class of equations, whose general existence theory has been recently developed by Trudinger, 
goes beyond the framework of optimal transport. We obtain pointwise estimates for weak so¬ 
lutions of such equations under minimal structural and regularity assumptions, covering situa¬ 
tions analogous to that of costs satisfying the A3-weak condition introduced by Ma, Trudinger 
and Wang in optimal transport. These estimates are used to develop a C 1,a regularity theory 
for weak solutions of Aleksandrov type. The results are new even for all known near-field reflec¬ 
tor/refractor models, including the point source and parallel beam reflectors and are applicable 
to problems in other areas of geometry, such as the generalized Minkowski problem. 
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1. Introduction 

1.1. Overview. This paper is concerned with the regularity theory of a broad class of Monge- 
Ampere type equations spanning optimal transport and geometric optics. These may sometimes 
he outside the scope of optimal transport but always have a Jacobian structure, namely 

det (D[T(x, Du,u)]) = ip(x, Du, u), (1-1) 
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for some T : dom(T ) C Q x xK-> (see below). Admissible u are “convex”, i.e. 

D[T(x, Du, u)] > 0, 

which is necessary for (1.1) to be a degenerate elliptic PDE. To appreciate the generality of 
(1.1), note it covers the real Monge-Ampere equation, the c-Monge-Ampere equation from 
optimal transport with cost c, the point source near-field reflector problem from geometric 
optics, several variations of the Minkowski problem and principal-agent problems in economics 
when dealing with a non-quasilinear utility function (references to the relevant literature are 
given below, see also Section 3). Some of the corresponding T’s are 

T(x,p,u ) = p (real Monge-Ampere equation), 

(. D x c)(x,T(x,p,u)) = —p (Optimal transport with cost c(-, •)), 

- P 

T(x,p,u ) = 7—75 - 7 --— 777 (Point source near-field reflector). 

| p\ z — [u-p- x) z 

These and other examples will be discussed further in Section 3. The mappings T(x,p,u ) 
considered here will always be given by a generating function. This means there is a function 
G : dom(G ) CMxMxM—>R and associated “exponential mappings” exp® n (-), Z x (-, •) (see 
Section 4 for definitions) such that 

T(x,p,u) = expg tt (p). 

For such T’s, (1.1) takes the form 

det(D 2 u + (D 2 G)(x, exp xu (Du), Z x (Du, u))) = Du,u). (GJE) 

The corresponding convexity condition for u asks that it be of the form 

u(x) = sup G(x, X, Zx) 

X 

for some function x —> z x E M, x E M. Following work of Trudinger [Trul4b], where the 
general framework for these equations is proposed, equation (GJE) will be called a “Generated 
Jacobian Equation.” The distinguishing feature of (GJE) is the dependence of the mapping T 
on the values of the solution, which is not present in the case of optimal transport. Recall that 
in optimal transport, one has 

G(x, x, z) = —c(x, x) + z, 

T(x,p, u) = T(x,p) = exp c x (p) 

where c(x, x) denotes the cost function. In general, changing the “height” parameter z in 
G(x,x,z ) will result in a change in the shape of the function, and not merely a vertical shift; 
and the choice of coordinate systems must now take this into account. 

The aim of this work is to determine the differentiability of weak solutions to (GJE) under 
minimal on assumptions on the data (including the generating function G, and the function 
ifc)- Specifically, we focus on weak solutions to (GJE) in the “Aleksandrov sense”, we also 
require the right hand side of the equation to bounded away from zero and infinity. The notion 
of Aleksandrov solution originated in the study of the real Monge-Ampere equation and has also 
played a key role in optimal transport, see Definition 4.16 for the setting of generated Jacobian 
equations. Our results are new even for the case of near-field reflector/refractor problems, 
covering situations where the condition (G3s) fails but (G3w) still holds. The (G3w) condition 
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was introduced by Trudinger in [Trul4b], it generalizes (A3w) condition for the Ma-Trudinger- 
Wang tensor [MTW05]. Both the MTW tensor and the (A3w) condition play a central role in 
the regularity theory of optimal transport. 

This general framework makes our results applicable to problems beyond geometric optics. 
Roughly speaking, these results are in the same vein as Caffarelli’s localization and differen¬ 
tiability estimate for the real Monge-Ampere equation [Caf90b]; Figalli, Kim, and McCann’s 
regularity theory for optimal transport maps under the (A3w) condition [FKM13a], as well as 
work by Vetois [Vetl5], and by the authors on the strict c-convexity of c-convex potentials 
[GK15]. 

In the spirit of [GK15], the most important assumption on G is a synthetic version of (G3w), 
which is roughly a “quantitative quasiconvexity” condition along G-segments (G-QQConv). 
This condition follows from (G3w) when G is smooth enough, it generalizes the (QQConv) 
condition introduced in [GK15] for optimal transport (and in that case, it refines Loeper’s 
maximum principle). 

Our main results can be broadly separated in two parts. The first part consists of pointwise 
inequalities, Theorems 2.1 and 2.2, for G-convex functions u (see Definition 4.14). These are 
obtained under natural assumptions on G and u, one of the key conditions being (G-QQConv) 
mentioned above. The pointwise inequalities may be thought of as nonlinear analogues of the 
Blaschke-Santalo inequalities for the Mahler volume (see discussion in Section 1.2) 

The second part comprises Theorems 2.3 and 2.4, in which we prove strict G-convexity and 
interior G 1,a differentiability respectively of weak solutions u of (1.1). This part relies on the 
pointwise inequalities in Theorems 2.1 and 2.2 to show solutions satisfy a localization prop¬ 
erty (which leads to strict convexity) and an engulfing property (which leads to interior C 1,a 
estimates). Finally, we show that for G smooth enough, condition (G-QQConv) is implied by 
(G3w) (Theorem 2.5). Precise statements for these results are given in Section 2. 

Acknowledgements. The authors would like to thank Neil Trudinger for helpful correspon¬ 
dence and also for graciously sharing some of his unpublished work. The authors would also 
like to express their gratitude to the anonymous referee for their prompt and thorough reading 
of the manuscript, as well as suggested improvements. 

1.2. Strategy: Mahler volume and Monge-Ampere equations. In order to motivate the 
main results (Section 2) it will be convenient to recall several facts about the Mahler volume 
and relate it to the regularity theory for the real Monge-Ampere equation. Let S' be a convex 
set with non-empty interior and whose center of mass is 0. The Mahler volume of S, m(S), is 
defined as 


m(S) :=|S| £ |S*| £ , 

where the set S* is the polar dual of S, 

S* := {y G M n | (s', y) < 1, V x' e S}. 

Then, the celebrated Blashke-Santalo and reverse Santalo inequalities together say that 

Cn 1 <\S\ c \S*\ c <c n . (1.2) 

These geometric inequalities imply (and are in fact equivalent) to certain pointwise inequalities 
for convex functions. Suppose a convex function u : M n —> M and an affine function l are such 
that the set S := {u < 1} is nonempty, bounded, and with center of mass at 0. Then, one can 
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use (1.2) to prove the bounds 

sup (l - u) n > C~ l |S| £ |3u(±S)| £ , (1.3) 

{l(x)-u(x)) n <C n \S\ c \du(S)\ c , Vx£S. (1.4) 

These two estimates are crucial in the theory of the Monge-Ampere equation, in fact, they 
are the basis of Caffarelli’s theorem on the strict convexity and differentiability of Aleksandrov 
solutions of the real Monge-Ampere equation [Caf90b] (see the discussion in [Caf92a, Part 
2], and the discussion in [GK15, Section 1.3]). For now let us explain informally how these 
estimates may be used to obtain regularity to solutions of the Monge-Ampere equation, namely 
C 1,1 regularity and strong convexity. Note first that a convex function u is C 1,1 and strongly 
convex if and only if there is a C > 0 such that for any supporting affine function Iq and every 
small enough h > 0 we have the inclusions 


^c-iy/h C {u < /o + h} C (1.5) 

In other words, the level sets of u are comparable to those of a paraboloid. Now, let u be an 
Aleksandrov solution to det(B 2 u) = /, with A < / < A (see Definition 4.16). Let us see to 
what extent would something like (1.5) hold for u. Since u is an Aleksandrov solution, we have 
\du(S)\ c in which case estimates (1.3)-(1.4) imply 


B c~ 1 Vh 


< \{u <lo + h}\ c < 


B 


CVh 


( 1 . 6 ) 


for some C > 0. This is a weaker assertion than (1.5), since we only compare the measures of the 
sublevel sets. The approach introduced by Caffarelli in the context of the real Monge-Ampere 
equation [Caf90b, Caf91, Caf92b] shows how to go beyond (1.6) and obtain C 1,ot regularity and 
strict convexity for u, and even (1.5) and higher regularity if / is assumed to be regular. See 
Gutierrez’s book [GutOl] for a comprehensive exposition of these ideas. 

For the c-Monge-Ampere equation arising in optimal transport, Figalli, Kim, and McCann 
obtained [FKM13a] analogues of (1.3)-(1.4) under the (A3w) assumption of Ma, Trudinger 
and Wang, from where they obtained C 1,a and strict c-convexity estimates. In [GK15] the 
authors introduced a condition on costs, “quantitative quasiconvexity” (QQConv), and used 
it to derive analogues of (1.3)-(1.4). This (QQConv) condition is a refinement of Loeper’s 
“maximum principle” [Loe09] but at least for C 4 costs turns out to be equivalent to (A3w) 
(and thus to Loeper’s condition itself). 

Beyond C 1,a and C 2,a estimates, these inequalities are also an important tool in deriving 
W 2,p estimates [Caf90a] under extra assumptions on /, and more recently W 2,1+e estimates 
under minimal assumptions [DPF13, DPFS13]. See the survey by Figalli and De Philippis 
[DPF14] for a thorough discussion of recent optimal transport literature (see also Section 3). 

1.3. Notation. Before continuing with the introduction, let us set up some notational con¬ 
ventions used within the paper: (•, •) will denote the evaluation pairing between an element 
of a vector space and an element of its dual space. ( M,g ), ( M,g ) will denote fr-dimensional 
complete Riemannian manifolds. Points in M will be denoted with x,y... points in M will be 
denoted with x,y,... while |-|£ will denote the Riemannian volume on ( M,g ), ( M,g ), or the 
associated Riemannian volumes on a tangent or cotangent space. and | • | g x will denote 

the length of tangent or cotangent vectors, with respect to the inner products g x and g x , and 
d g (-, •), dg(-, •) will refer to the geodesic distances induced by the respective metrics. Also, we 
will use A mt , A cl , and AP to refer to the interior, closure, and boundary of a set A respectively. 
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Here is a summary of several other symbols, together with their definition number. 


Notation / Condition 

Name 

Definition location 

G, H 

Generating function, dual function 

Section 4.1 

05 b 


Section 4.1 

(Unif), (Lip Ao ), K 0 


Definition 4.1 

(G-Twist), (G*-Twist) 


Definition 4.3 

(G-Nondeg) 


Definition 4.5 

E{x, x , z), E(x, x, z) 


Definition 4.5 

Px,z (x), Px,u(x) 


Definition 4.6 

[^4]ai,2: 


Definition 4.6 

[xo, Xl\x, z 

G-segment 

Remark 4.9 

ex P£;*(■), exp£ u (-), ZC(.,.) 

G-exponential mappings 

Definition 4.7 

(DomConv*), (DomConv) 


Definition 4.11 

(G-QQConv), (G*-QQConv) 


Definition 4.13 

m,m 0 , • • • 

G-affine functions 

Definition 4.14 

u,u 0 ,... 

G-convex functions 

Definition 4.14 

d G u 

G-subdifferential 

Definition 4.15 


nice, very nice functions 

Definition 4.18 


very nice constant 

Remark 4.29 

A* 

Polar dual 

Definition 6.2 

a g 

arm, A 

G-dual 

Definition 4.22 

<s(0 

G-cone 

Definition 4.25 

U A 

Supporting hyperplane 

Definition 5.1 


2. Statement of main results 

In this section we state the exact form of our main results. The precise statement themselves 
involve a great deal of notation that will not be introduced until Section 4, however, for the sake 
of having all the main results stated in one section, we choose to present them here. Thus, the 
reader is advised to skim through this section on a first reading and return to it after reading 
the elements of generating functions in Section 4. 

Structural assumptions. All of the theorems below require a number of structural as¬ 
sumptions on G and its domain of definition. In many important subclasses of examples (i.e. 
optimal transport, near field problems in optics) each of these structural assumptions are known 
to be necessary conditions for the regularity of solutions. 

Then, we are given n-dimensional Riemannian manifolds M, M. a generating function which 
is a function G : MxMxK —> M; we are also given doma in s D C M, Cl C M, and gCMxMxR. 
We assume these objects have the following properties (see Section 4 for details) 

(I) G(x, x, z ) is C 2 in the sense that all purely mixed second derivatives exist and are 
continuous in all of M x M x R. Moreover, G z < 0. 

(II) There are constants —oo < u < u < oo and Kq > 0 such that Cl, Cl, and g satisfy 
(Unif), (Lip A() ), (DomConv), and (DomConv*) with respect to the interval (u,u). 


(Ill) G satisfies (G-Twist), (G*-Twist), (G-Nondeg), (G-QQConv) and (G*-QQConv). 
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We are also given a function u (eventually, the solution to (GJE)), assumed to satisfy the 
following (see Definition 4.18 and Remarks 4.20 and 4.29) 

(IV) u : D —> M is a very nice G- convex function with an associated 
very nice interval [u n ,un]- 

Remark. The notion of very nice for a G-convex function is explained in Definition 4.18, this 
notion is irrelevant in optimal transport, where all G-convex functions are automatically very 
nice. The necessity for this notion for general Generated Jacobian equations is illustrated by 
phenomena present in the near field problem (see Karakhanyan and Wang [KW10, Theorem 
A,B]). This is discussed in detail at the end of Section 3.1. 

Finally, in all what follows M > 1 will denote the constant associated to G by (G-QQConv) 
and (G*-QQConv) with the interval [« n ,«n]- 

The first result is an Aleksandrov type estimate, which will play the role that (1.4) plays for 
the standard Monge-Ampere theory. 

Theorem 2.1. Suppose m(-) := G(-,x,z) for some ( x,z ) G fi x M is a nice G-affine function 
and S := {u < m}. Also assume that [S]^ C B C 3 B C [D] s>2 for some ball B in T£M 
(which may be of any radius). Then, there exist very nice constants e, G > 0 such that for any 
uq E § n-1 C TIM and xq E 5 int , if diam(5) < e then 

cd(po, ns,. ) 

(m(x o) - ti(* 0 )) B < ) ^ \S\ C \d G u(S)\ c , 

X,Zl ) 

where po := Px, z {xo) and l([S]x, Z i &q) is defined as the maximum length among all line segments 
parallel to u 1 and contained in [5]^ iZ . 

The second result gives a generalization of estimate (1.3). 

Theorem 2.2. Suppose m(-) := G(-,x,z ) for some (x, z) G Cl X M is a G-affine function such 
that u N < m < un on D cl . Writing S := {x E D | u{x) < m(x)}, there exist very nice constants 
G, K > 0 such that for any A cSl with [A]x,z connected, satisfying 

KM[A] StZ C [S]x, z , (2.1) 

sup m + sup (m — u) < u, (2-2) 

A A 

we have 

sup (m — u) n > G \A\ C | d G u (A )\ c . 

A 

Here KM[A]x, z is the dilation of [A]^ with respect to its center of mass Px,z(x cm ). 

Our next two results concern weak solutions u to (GJE), in the sense of Aleksandrov (see 
Definition 4.16). We use the notation Do for the support of the Radon measure \d G u(-)\ c and 
:= d G u (Do). The first of the two theorems deals with the strict G-convexity of u. 

Theorem 2.3. Suppose u is a very nice Aleksandrov solution of (GJE). If Dq 1 C D mt and 
Dq 1 C D mt , and [Do] XOiU ( Xo ) is convex for some xq E Dq 1 *, then u is strictly G-convex at xq , i.e. 
if xq E d G u(x o), then the set {x E D | u(x) = G(x, xq, H(xq, xq, u(xq))} is the singleton {xq}. 
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We prove (interior) C 1,a regularity of weak solutions (provided they are very nice). The proof 
relies on the previous theorems as well as extensions of the engulfing property of sublevelsets of 
solutions for the real Monge-Ampere Equation (see [FM04],[FKM13a, Section 9]). 

Theorem 2.4. Suppose in addition to the assumptions of Theorem 2.3 above, that G is a C 1,a 
function in the x variable for some a G (0,1], uniformly in the (x,z) variables. Then there 
exists an (3 G (0,1] such that u G (^o nt )- 

Our final result connects the (G3w) condition introduced by Trudinger [Trul4b] with the 
conditions (G-QQConv) and (G*-QQConv). 

Theorem 2.5. Assume there are —oo < u < u < oo such that Cl, Cl, and g satisfy (Unif), 
(DomConv), and (DomConv*) with respect to (u,u). Also assume G is C , by which we mean 
all derivatives of up to order 4 total, with at most two derivatives ever falling on one variable 
x, x, or z at once, exist and are continuous and G satisfies (G-Twist), (G*-Twist), (G-Nondeg), 
and (G3w). Then G also satisfies both (G-QQConv) and (G*-QQConv). 

2.1. Overview of the rest of the paper. A detailed discussion of examples of (GJE) covered 
by our results is carried out in Section 3, examples discussed include the near-field reflector 
problem and the generalized Minkowski problem. In Section 4 we review the elements of 
generating functions and the associated Jacobian equations (GJE) (following to a great extent 
the ideas in [Trul4b]), we also introduce the (G-QQConv) and (G*-QQConv) conditions on G. 

In Section 5 we show how (G-QQConv) and (G*-QQConv) lead to the Aleksandrov-type 
estimate, Theorem 2.1. In Section 6 we prove the sharp growth estimate, Theorem 2.2. In 
Section 7 we use the pointwise estimates to prove a localization property for weak solutions, 
and their strict convexity (Theorem 2.3). The work of all previous sections are combined in 
Section 8 to prove solutions are G 1,a (Theorem 2.4). 

Finally, in Section 9 we prove (Theorem 2.5) that the condition (G3w) (defined by Trudinger 
in [Trul4b] to obtain classical regularity in generated Jacobian equations), implies conditions 
(G-QQConv) and (G*-QQConv). 


3. Examples 

3.1. Point source, near-field reflector. For our first example, we spend some time discussing 
the near-field reflector problem, as it is a well-studied problem that gives rise to a generated 
Jacobian equation (GJE) which does not arise from an optimal transport problem, and as such 
displays many subtle difficulties not seen in the optimal transport case. 

The engineering literature on reflector design is too large to review in detail here, but let us 
point out the reader to a few references, such as Oliker [01i89] Kochengin and Oliver [K098] 
and Janssen and Maes [JM92] for the case of cylindrical reflectors. For more on the literature 
and the exposition to follow, the reader is directed to the survey article [Oli03] by Oliker, the 
discussion in Karakhanyan and Wang [Karl4], See also the classical monograph by Rusch and 
Potter [RP70] for a broader introduction to the engineering of antennas. 
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Figure 1. 


We are given a light source at some point O € R 3 , that shines through a “source region” 
H C § 2 and a “target region” to be illuminated, which is a region fi contained within some 
codimension one surface M Cl 3 . Moreover, the light source may not have a uniform intensity, 
instead it radiates energy through D modeled by some absolutely continuous measure /dVol§ 2 . 

The goal is now to build a reflector: a (perfectly reflective) surface C R 3 given by the radial 
graph of some function p : D —> R with the property that light emanating from O according 
to the distribution / is reflected off to arrive in Cl. This problem is severely underdetermined, 
thus we also assume that we are given an absolutely continuous measure gdV oljg supported on 
D, and the reflector is required to recreate this measure as the resulting illumination pattern. 
The assumption of perfect reflection implies that the total masses of / and g must be equal. 
The usual plan of attack for this problem is to first assume the geometric optics approximation , 
in which light rays are treated like particles, completely ignoring any wave-like behavior that 
may be present. 

To motivate an elementary method of constructing such a desired reflector, consider the case 
where the target measure is not absolutely continuous, but a Dirac delta concentrated at a 
point ieO. Then the reflector can be taken as any ellipsoid of revolution with foci O and x. 
For 2 a > |a;| there is a unique ellipsoid of revolution with foci O and x whose major axis has 
length equal to 2a. A straightforward computation shows that such an ellipsoid can be written 
as the radial graph of a function e(-,x, a) : § 2 —> R + defined by 


e(x, x, a) 


1|~|2 


a ~ 4IT 
a — \ (x, x) 


where (x,x) is the Euclidean inner product in R 3 . We can view a here as a scalar parameter 
controlling the eccentricity of the ellipse, in particular we see there is a one parameter family 
of reflectors that solve our problem. 
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£ = {xe(x,x, z) | x G § 2 } 



Figure 2. 

If the target measure is now a finite sum of weighted Dirac deltas, we can take the reflector 
to be the boundary of the intersection of the same number of ellipsoids, each with one focus 
at O and the other at a point where the sum of deltas is supported. By adjusting the scalar 
parameters, we can ensure each point in the target receives the correct amount of energy. 

One can then approximate the absolutely continuous target measure by a sequence of such 
finite sums of Dirac deltas, and rigorously justify a limiting process to obtain a reflector that 
is the boundary of an intersection of (an infinite) family of ellipsoids, or in terms of p: 

p(x) = inf e(x,x,a) (3.1) 

(x,a)&A 

for some appropriate collection A. 



Figure 3. 

This representation of p can be interpreted as a form of “concavity” of p , where instead of 
hyperplanes as in the usual case of a concave function, p is supported from above by graphs of 
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ellipsoids which serve as some sort of “fundamental shape”. Indeed, if we take 


G(x, x , z) 


1 

e(x, x, z -1 ) ’ 


dehned in 


0 = { (x, x, z) € S 2 x M x R + | \z\x\ < l} , 

then p will exactly be a G-convex function as in Definition 4.14. 

When M can be written as the graph of a function over a portion of R 2 , it can easily 
be verified that our choice of G coincides with that of Trudinger in [Trul4b, (4.15)]. In the 
particular case when M is contained in a hyperplane parallel to R 2 lying below R 2 , from the 
formulae in [Trul4b, Section 4] it can be seen that G satisfies conditions (G-Twist), (G*-Twist), 
(G-Nondeg), and (G3w), and (Unif) with (u,u) = (0, oo). The main difference here from the 
usual case of convexity / concavity (or indeed, from the optimal transport case known in the 
literature as c-convexity ), is that when the scalar parameter z is changed in any of the functions 
e forming the infimum in (3.1), there is a nontrivial change in the shape that goes beyond a 
simple translation or dilation. 

Next one can consider what is known as the ray-tracing map , a map T p : D — > Cl that simply 
gives the location that a beam originating through x ends up after reflecting off of Tp. 

It can be seen that to obtain the desired illumination property, it is sufficient to impose a 
prescribed Jacobian equation of the form f (x)detDT p (x) = g{T p {x)). From the form (3.1) of 
p, and a calculation of T p in terms of the derivative of p , this equation can be re-written as a 
generated Jacobian equation of the form (GJE). In fact, the choice of u = p~ 1 will be a solution 
of (GJE) with our above choice of G, and a certain i/jq involving the densities / and g. 



Figure 4. 

It should be noted that a question of deep physical interest now is regularity of the reflector. 
Indeed, non differentiability of a reflector would cause diffraction phenomena, which may not 
be accurately modeled by the geometric optics approximation. In the case of refraction prob¬ 
lems which also give rise to generated Jacobian equations, singularities can lead to chromatic 
aberrations, which also lie outside the realm of geometric optics. 
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Recent work of Karakhanyan and Wang [KW10] guarantees regularity ( C 2,a ) for reflectors. 
Their main result illustrates some of the complexities that arise once we leave the optimal 
transport framework (see in particular Remark 3.1 below). 

Theorem. [KW10, see Theorems A-B] Suppose that 

(1) hi, V C 8 n_1 , n V = 0, has Lipschitz boundary. 

(2) D is a region in a convex hypersurface M, given by a radial graph of some smooth 
function over V. 

(3) f : 12 — y M, q : Cl —> M smooth, strictly positive functions with the same total mass. 

(4) dQ is “R- convex.” 

Then, there is a reflector that is contained in a region close to O that is smooth. 

The authors continue on to give finer conditions to obtain regularity (see [KW10, Theorem 
C]). In particular, they provide a condition on the second fundamental form of the target 
hypersurface M corresponding to the (G3s) condition; they demonstrate regularity under this 
condition, and that if a version of the condition corresponding to (G3w) fails then there are 
smooth, positive / and g for which the reflector is not even C 1 . 

Remark 3.1. Another important difference with the regularity theory of optimal transport 
is that two solutions for the same data / and g may exhibit different regularity. In fact, 
the existence of such examples can be proven, see the discussion on page 567 of [KW10]. 
This difficulty is what requires us to have to consider the notion of very nice solutions, see 
Definition 4.18 and the remarks that follow it. 

We point out that our method of proof is entirely different from those of [KW10], as their 
method relies on uniform a priori estimates, while in this paper we rely on pointwise estimates 
of the solution. In particular, we are able to handle the borderline case corresponding to the 
(G3w) condition. However, it should also be noted that the results of [KW10] (as those of 
[GT14], see below) are finer than ours in the sense that they are “local” in nature: their result 
can characterize and separate regions of regularity and nonregularity of solutions, while ours 
are “global”: we can only find a solution to be regular on its whole domain, or not. 


3.2. Other geometric optics problems. There are a number of other geometric optics prob¬ 
lems that also result in generated Jacobian equations of the form (GJE), which do not fall within 
the optimal transport problem. Some of this we mention briefly (even though they each de¬ 
serve as lengthy a discussion as the previous). One can, for example, consider problems of 
refraction instead of reflection with a point light source, as considered in works by Gutierrez 
and Huang [GH14] ; and Oliker, Rubinstein, and Wolansky [ORW15]. In another direction, one 
can change the light source to be a parallel beam instead of a point source (see [Karl4]), or 
consider multiple optical instruments instead of just one (see work of Glimm and Oliker [GO04], 
and Oliker [Olill]). Another interesting family of problems are models with nonperfect energy 
transmission, as studied by Gutierrez and Mawi [GM13] and Gutierrez and Sabra [GS14]. 

There are regularity results available for several of these problems, under assumption (G3s). 
We highlight recent work of Gutierrez and Tournier [GT14] dealing with the (near field) parallel 
beam reflection and refraction problems. Their results include C 1,a estimates without any 
smoothness assumptions on the source and target measures. Moreover, unlike our results, the 
results in [GT14] only require local assumptions regarding the “niceness” of the solutions. 
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To give a concrete example, let us write down the generating function for the parallel beam, 
near-field reflector problem. Let <J> be a smooth function on some compact region of M 2 (whose 
graph represents the target surface to be illuminated) and for (®, x, z) E M 2 x M 2 x M + let 


G(x, x , z) 


1 

2 



— z\x — x\ 2 


+ $(x). 


Then a solution of (GJE) with this G will solve the reflector problem, with an appropriate 
choice of right hand side depending on the input and output light patterns. There is a detailed 
verification of conditions (Unif), (G-Twist), (G*-Twist), (G-Nondeg), and (G3w) contained in 
[JT14, Section 4.2] for this choice of G, with (u, u) = (0, oo). 


3.3. Optimal transport. Fix any two domains Cl C M and Cl C M in Riemannian manifolds, 
suppose we have a measurable cost function c : fl cl x fl cl —> M, and probability measures /r and 
v with supports in Cl and Cl respectively. The optimal transport (Monge-Kantorovich) problem 
is to find a measurable mapping T : spt p —> spt u defined fi-a.e. with TU/i = u. minimizing 

/ c(x, S(x))n(dx) 

Jn 

over all measurable S : spt (jl — > spt v with S#p = v. 

The connection of the optimal transport problem with generated Jacobian equations is 
through defining 

G(x, x, z) := —c(x, x) — z. 

With this definition, various structural conditions reduce to well-known conditions, for example 
in the notation of [GK15, Section 2]: (G-Twist) and (G*-Twist) to (Twist), (G-Nondeg) to 
(Nondeg), (DomConv) and (DornConv*) to (DomConv) there, (G-QQConv) and (G*-QQConv) 
to (QQConv), and (G3w) and (G3*w) to (A3w) (also known as the Ma-Trudinger-Wang or 
(MTW) condition). If (Twist), (Nondeg), and (A3w) hold on fl cl x Q cl , note that 0 = fl = 
n cl x n cl x M hence (Unif) is satisfied with (u, u) = M. Also with these conditions, if p, 
v <C d\o\M it is known that a solution of the optimal transport problem can be obtained from 
a G-convex potential function u satisfying (GJE), by the expression T(x) := exp ^ (Du(x)), 

for any choice of uq E M (see [Bre91, GM96, McCOl, MTW05]). There is also a regularity theory 
based on conditions (A3w) and (QQConv), see Section 9 for more comments and references. 

We also point out a connection of optimal transport to the near-field reflector example in 
Section 3.1. If the target surface M is very far from the source O, then any point being 
illuminated is approximately determined by the direction of the beam after reflection. Relatedly, 
if the focus x is far away, then the corresponding ellipsoids are close to being a paraboloid of 
revolution. Thus taking a limit as the target object goes out to infinity, one obtains the far-field 
reflector problem , which can be viewed as a problem where both domains are the sphere, and 
reflectors are constructed as envelopes of paraboloids of revolution. Mathematical study of the 
far-held rehector problem itself stretches back several decades ([HK85, CO08, CGH08]). The 
realization that this problem was equivalent to an optimal transport problem for the cost 

c(x,x) := log (1 - (x,x)) 

on § 2 x § 2 (see Glinun and Oliker ([GO03]), X.-J. Wang ([Wan96, Wan04]), Guan and Wang 
([GW98])) was very fruitful and served as motivation much work in both the mathematics of 
rehectors and optimal transport. 
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3.4. Generalized Minkowski problem. A different kind of generated Jacobian equation is 
given by the classical Minkowski problem. Recall that given a convex body B C W 1 , O e B, 
its supporting function is a function h : S n_1 —> M defined by 

h{x) = sup(g, x), x £ § n_1 . 

q&B 

It is well known that if K(x) denotes the Gauss curvature of the boundary of B at the point 
with outer normal x, then (see [L095]) 

det (V? )h + h gij ) = 

where (ji 3 denotes the standard metric of § n_1 and denotes the respective covariant deriv¬ 
ative. The classical Minkowski problem consists in recovering B from K(x): given a function 
K(x) on the sphere satisfying certain compatibility conditions, does there exists a smooth, 
strongly convex body B whose Gauss curvature at the point with normal x is equal to K{x)l 
The formula above shows that in terms of the support function of B, this problem falls within 
the scope of equation (GJE). 

Motivated by questions stemming from the Brunn-Minkowski theory of mixed volumes, Lut- 
wak and Oliker [L095] considered the more general p-Minkowski problem (p > 1) which asks 
to find, for a given function K : — y M, a convex set whose support function h solves 

det (V%h + h 9ij ) = hP ~ l dC ^y 

For p > 1, and K(x) a positive, even function. When p = 1 this gives back the original 

Minkowski problem. 

Let us make a few comments about the validity of the various structural assumptions for this 
example when p = 1 (see Section 4 for definitions). First, the generating function is given by 

G(x, x, z ) = z(x, x), 


where 


0 = {(m, x, z) G § n_1 x S n_1 xR| (s,i)>0, z > 0}. 

Then, a straightforward computation shows that 

DG(x, x, z) = z(x — (x, x)x), DG(x, x, z) = z{x — (x, x)x), G z (x, x, z) = (x, x). 

From here it is not difficult to check the injectivity of ( DG(x , x, z), G(x, x, z)) as a function of 
(x, z) (for any fixed x) as well as the injectivity of 

DG(x,x,z) z 

-7T7—=—r =- -K x - (* ' ®)z), 

G z (x, x,z) x ■ x 

as a function of x (for any fixed (x,z)), therefore G verifies (G-Twist) and (G*-Twist). It 
is not hard to see that the above maps are local diffeomorphisms, and thus the condition 
(G-Nondeg) also holds (see also Remark 4.10). The validity of condition (G-QQConv) remains 
to be determined for this particular G. 
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3.5. Stable matching problems with non-quasilinear utility functions. Finally, it is 
worthwhile to point out a recent preprint of Noldeke and Samuelson where a Generated Jacobian 
Equation arises in economics. In [NS13], the authors consider stable matching problems and 
principal-agent problems where agents may have utility functions that are not quasilinear. More 
concretely, in this setup, x G X represents all possible buyer types while y G Y represents all 
possible seller types, and v G M is a monetary transfer (i.e. the price of a product). One is 
given utility functions cj)(x, y, v ) and ip(x, y, u), <f(x, y, v ) being the intrinsic value that x receives 
when purchasing from seller y at a price of v, while ip(x,y,u) represents the utility y obtains 
when making a transaction with x, by providing x with a utility of u. Naturally, these functions 
satisfy the inverse relation fi{x,y,if{x,y,u)) = u. 

The stable matching problem is then as follows: given two probability measures y and v on 
X and Y, find a pair of utility profiles (u, v ) which are measurable, real valued functions on X 
and Y, and a bijective, measurable matching y : X Y such that 

u(x) = 4>(x,y(x),v(y(x))), 

v{y) = ^{y~ 1 (y),y,u{y~ 1 (y))), 

y#y = g 

where it is asked that (u, v, y) be stable, meaning that 

u(x) > fi(x,y,v(y)), v(y) > ip(x,y,u( z)),V x 6 X, y eY. 

In other words, each buyer and seller gets the most utility out of the particular matching y, 
and so they have no incentive to pick different parties to deal with. Thus for a stable matching, 
the profile u is a ^-convex function, satisfying some weak version of equation (GJE) (with right 
hand side depending on the measures y and v). 

A utility function is said to be quasilinear if it has the form 4>(x, y, v ) = b(x, y) — v. In this 
case, the stable matching problem reduces to an optimal transport problem, and is also related 
to a hedonic pricing problem. This direction has been explored by Ekeland ([Eke05, EkelO]), 
and later by Chiappori, McCann, and Nesheim ([CMN10]). Figalli, Kim, and McCann have 
also shown that the problem becomes a convex screening problem, under a strengthening of 
the (A3w) condition, often known as “non-negative cross curvature” in the optimal transport 
literature (see [FKM11]). Moreover, for this quasilinear case, the structural assumptions for G 
discussed in Section 4 reduce to the standard ones for optimal transport (see Example 3.3). 

It is worth noting that in the terminology introduced at the beginning of Section 4, the 
function (f corresponds to G, while if corresponds to the dual generating function H. We do 
not know yet of any specific, multi-dinrensional non-quasilinear utility functions for which our 
assumptions hold. It would be worthwhile to find concrete examples of such utility functions 
which are different from the generating functions in the previous examples, and to provide 
economic interpretations for these structural assumptions. 

4. Elements of Generating Functions 

4.1. Basic definitions. Suppose (. M,g ) and ( M,g ) are n-dimensional Rienrannian manifolds. 
We fix a real valued generating function G(-, •, ■) defined on M X M x I for some open interval /; 
after a change of variables that will not affect any of the other conditions we pose on G, it can 
be assumed I = M (which we will do for the remainder of the paper). We will use the notation 
D for derivatives in the x variable, and D for derivatives in the x variable, while G z , G zz , etc. 
denote derivatives in the scalar z variable. We also assume that G is C 2 in the sense that any 
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second order derivative in the variables x, x, and 2 which is mixed (i.e., DDG , or DG Z , etc) is 
continuous and that G z (x,x,z) < 0 for all (x,x,z). 

The inverse function theorem yields the existence of a unique function H(x,x,u ) such that 

G(x, x. H(x, x, u)) = u. 

H(x,x, •) is defined on some open interval (which may depend on (x,x)) with H u < 0, and H 
is C 2 in the above sense. Whenever we write an expression of the form H(x,x,u), it is with 
the understanding that u is in the range of G(x,x, •)■ 

As in [Trul4b], we require G to satisfy certain structural conditions. These assumptions will 
hold on a subset of the domain of G, denoted 0 (and fixed from now on), which has the form 

0 := {(x, x, z) G M X M x ! | z G /g(x, ®)} > 

where for each (x,x) E M x M the set Iq(x,x) is an open interval (possibly empty). Similarly, 
we will deal with the set 


t) := {(x, x, u) G M X M x M | u G Ih(x , x)}, Ih(x, x) := G(x, x, Ig(x , x)). 

The following condition is a relaxation of the (G5) condition presented in [Trul4b], and is also 
due to Trudinger [Trul4a]. 

Definition 4.1. A generating function G and bounded, open domains hi C M, flc M are said 
to satisfy uniform admissibility if there are constants —00 < u < u < 00 and 0 < K$ < 00 
for which, whenever (x,x,u) € ff cl X fi cl X (u, u), then 

(x,x,H(x,x,u)) G 0 , (Unif) 

\DG(x,x,H(x,x,u))\g x <K 0 . (Lipx 0 ) 

Remark 4.2. One elementary but useful consequence of (Unif) is that if G(x,x,z) G {u,u)^ 
then we must have (x,x,z) G 0 . Indeed this is immediate as if u := G(x,x,z), by definition 
H(x,x,u) = z. We will use this fact frequently. 


Definition 4.3. The function G is said to satisfy the twist conditions if for any (xo, xq, zo) G 
M x M x M we have the following 

(1) The mapping 


(x, z ) i-)- (DG(x 0, x, z), G(xo,x, z)) G T* 0 M x M, 


is injective on the set {(x,z) G M x M (xq,x,z) G 0}. 
(2) The mapping 

G z (x,x 0 ,z 0 ) 0 

is injective on {x G M \ (x,xq,zq) G 0}. 


(G- Twist) 


(G*-Twist) 


Although conditions (G-Twist) and (G*-Twist) may seem quite different, they are actually 
symmetric in nature. See Remark 9.5 for more details. 


Remark 4.4. For the sake of brevity, the arguments in expressions such as (DG(xo,x, z), G(x 0 , x, z)) 
and w iH be written simply as (DG, G)(xq, x, z) and —^-(x,x,z). 
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Definition 4.5. The function G is said to satisfy the nondegeneracy condition if given any 
triplet (x,x,z) E 0 , the linear mapping E(x,x,z ) : T X M — > T*M defined by 

DC 

E(x,x,z)V := DDG(x, x, z)V — (—— (x, x, z), V)DG z (x, x, z), V E T X M (G-Nondeg) 

G z 

is invertible. The adjoint operator of E(x,x,z ) (which is also invertible under the assumption 
(G-Nondeg)), will be denoted by E(x,x,z) : T X M —> T£M, so 

(E(x,x,z)V,V) = (V,E(x,x,z)V), \/VeT x M, V E T X M. 

Definition 4.6. We will use the notation 


Px,u( x ) : = DG(x, x , H(x, x , u)), 

, , dg _ 

Px,zi x ) '■ = - 7 —{X,X,z). 

^z 

Also if A C f2 and (x, z ) are such that (x, x, z) E 0 for all x E A, we will write 

[A] s , z :=p XjZ (A)cT?M. 

Likewise, if A and (x, u) are such that (x, x, u) E fi for all x E A, we will write 

[A]x, u :=p x ,u(A)cT*M. 


Definition 4.7. Due to (G-Twist) and (G*-Twist) there are differentiable maps 

expg 2 (-), expg u (-), (■,■), 

respectively defined on subsets of T£M,T*M, and T*M x M, by the system of equations 


(DG,G){x,ex.p^ u (p), Z^(p,u)) = ( p,u ), V (p,u) E (DG,G)({(x,x,z) \ (x,x,z) E 0 }) 


and 


DG ^ Z (p),x,z) =p, VpE ~^-({(x,x,z) | (x,x,z) E 0 }). 

Et 2 


G 


Note that by (G-Twist), 


Z x {p,u) = H(x,exp% u (p),u). 


G 


4.2. G-convex geometry. The coordinate systems given by Px,z(') and p x , u {') are of great 
relevance to the study of the generating function G (see also Lemma 4.30). Of special interest 
are those domains in D (resp. Cl) that correspond to convex sets in at least one of these 
coordinate systems. The same can be said for curves in 0 (resp. Cl) that correspond to a 
straight line segment in one of these coordinate systems. These ideas are recalled in detail 
below. 


Definition 4.8. A differentiable curve x(s) in M (s E [0,1]) is said to be a G-segment with 
respect to (, x, z) E M x M, if for all s E [0,1] we have that (x(s),x, z) E 0 and 

Px,z(x(s)) = (1 ~ s)ps j2 (x(0)) + spx,z(x( 1)). 

Likewise, a curve x(t) in M (t E [0,1]) is said to be a G-segment with respect to (x, u) E M x M, 
if for all t E [0,1] we have that (x, x(t),u) E b and 

Px,u{x(t)) = (1 - t)p XjU (x( 0)) + tp XjU (x( 1)). 
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Remark 4.9. If x(s) is a G-segment with respect to (x,z) with x(0) = xo, x(l) = xi, we 
will use the notation \xq,x\\ Sz for the image x([0,1]). Moreover, given xo, x\ G M, and 
(x,z) G M x M, by an abuse of notation we will write x(s) := [xo,x\]^ z to signify that x(s) 
is the (unique) parametrization of a G-segment given in the above definition with x(0) = xo, 
x(l) = x\. Additionally, when we say x(s) is well-defined it specifically denotes that for all 

s G [0,1], (1 — s)px, z (x o) + s'Px.z(xi) lies in the image |— ^p-{x, x, z) \ x G M, (x,x,z) G gj. A 
similar remark holds for G-segments x{t) in M. 

Remark 4.10. Fixing local coordinates in M and M, the matrix representation of E(x,x,z ) is 

G G-i 

771 w x % Z^* Xl 

^ij ~ { ^ r x i xi q • 

A routine calculation then shows that the derivatives of the maps x i->- Px,z(x) and x 
Px,u{x) are given by anc j E(x, x, H(x, x, u)) respectively, hence these mappings are 

G 1 -diffeomorphisms in a neighborhood of wherever (G-Nondeg) holds (i.e., near x such that 
(x,x,z) G 0 for Px,z(') an( l near x such that (x,x,u) G tl for p x , u {'))- In particular, this implies 
that G-segments are differentiable as long as they are well-defined. 

We also make some convexity assumptions on the domains 14 C M and 0, C M. 

Definition 4.11. We will assume that for any x G 14 cl , 

u G (u,u) =>• [Q]j; iU is convex. (DomConv*) 

Also suppose xo, xi G 14 cl , x G fl cl , and zeR with G(xo,x,z) G (u,u). Then we assume that 
kl is path-connected and 

(x 0 ,x,z), (xi ,x,z) G 0 

=> x(s ) := [xo,xi]g z is well-defined and [xo,xi] iz C fl cl . (DomConv) 

The next proposition computes the velocity of a G-segment in terms of the linear maps 
E(x, x, z) and E(x, x, z) 

Proposition 4.12. Let x(s) be a well-defined G-segment with respect to some (xo,^o) 5 x(t) a 
well-dehned G-segment with respect to some (xo,uo), and let 

z{t) := H(x 0 ,x(t),uo). 

Then, using the notation p s := p So>20 (x(s)) and p t := Px 0 ,u 0 (x(t)), we have the expressions 


x(s) = —G z (x(s),xo, zo)E 1 (x(s),x 0 ,z 0 )(pi -po), 
x(t) = E~ 1 (x 0 ,x(t),z(t))(p 1 - po), 

DC 1 

z{t) = (——(x 0 ,x(t),z(t)),x(t)). 

^z 


Proof. Differentiating the identity —^(x(s), xo, * 0 ) = (1 — s)po + sp\ in s yields 

--DDG) (DG Z , x(s))DG 

x(s) +-- =P 1 - Po, 


(4.1) 

(4.2) 

(4.3) 


G 
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where all expressions are evaluated at (x(s),xq,zo). Therefore, for an arbitrary V E T So M and 
s E [0,1], 


(Pi ~Po,V) = 


r ( (l - -T (([DDG] V,x(s)) - {DG z ,x(s))(^,V)) 

G z (x(s),xo,z 0 ) V G z ) 

n / / ! - - t{E(x{s),x 0 ,z 0 )V,x{s)) 

G z (x(s),xo,z 0 ) 


and (4.1) follows. Similarly, differentiating the identities 

DG(x 0 ,x(t),z(t)) = (1 - t)p 0 + tp 1} 
G(x 0 ,x(t),z(t)) = u 0 , 


in t we obtain 


[DDG] x(t) + DG z z(t ) = pi - po, 

{DG, x(t)} + G z z(t ) = 0, 

where all expressions are evaluated at (xo,x(t),z(t)). Rearranging the second line above and 
using that G z / 0 yields (4.3). We can then substitute (4.3) into the first line above to obtain 

DG 

Pi ~ Po = [DDG] i(t) - DG z (——,^(t)) 

Z 

= E(x 0 ,x(t),z(t))jc(t). 


Since E(xq, x(t), z(t)) is invertible by (G-Nondeg), the formula (4.2) follows. 


□ 


The last two conditions on the generating function G are as follows. 


Definition 4.13. We say G satisfies (G-QQConv), if for any compact subinterval [uq,uq] C 
(u,u), there is a constant M > 1 with the following property: take any xq, x\ E fi cl , x\, 
x 0 E U cl , zo E M such that G(x(s), xq, zo) E [uq,uq] for all s E [0,1] where x(s) := [xo-,x{\^ o . 

Then if z\ := H(xo,xi,G(xo,xq,zq)), it holds 

G(x(s), x\,z\)) - G(x(s), x 0 , zq) (G-QQConv) 

Ms 

< - - - j (G{xi,xi,H{x{s'),xi,G{x(s'),xo,zo))) - G(xi,x 0 , z 0 )), 

1 — s' 

for any s E [0,1] and s' E [0,1). Likewise, G satisfies (G*-QQConv) if for any compact 
subinterval [uqjUq] C (u, u) there exists a constant M > 1 such that: whenever xq E U c1 , xq, 
xi E f2 cl , uo E [uq,uq], and x\ E M with (x\, x(t), H{xq, x(t), uo)) E g for all t E [0,1], (where 
x(t) := [x 0 , x\] X0 UQ ). it holds for any t' E [0,1) that 

G(xi,x(t),H(xo,x(t),uo)) - G(xi,xo,H(xo,xo,uq)) (G*-QQConv) 

< Y ^~ t 7 [G{xi,xi,H(xo,xi,u 0 )) - G(x l ,x(t'),H(xo,x(t'),uo))] + . 

If G satisfies both (G-QQConv) and (G*-QQConv), we say that G is quantitatively quasiconvex. 

We note that due to assumptions (Unif), (DomConv), and (DomConv*), in the above defi¬ 
nitions both G-segments x(s) and x(t) are well-defined and remain in Q cl , fl cl respectively for 
all s, f 6 [0,1], 
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4.3. G-convex functions. 

Definition 4.14. A real valued function u defined on H is said to be G-convex if for any xq £ H 
there is a focus (xo, zq) £ Cl x M such that (xo, xq, zf) £ 0 and 

u(x o) = G(x 0 ,x 0 ,z 0 ), 
u(x) > G(x,xq, zo), V x £ fb 

Any function of the form G(-,xq,zo) will be called a G-affine function, and if it satisfies the 
above conditions we say it is supporting to u at x o- 

We remark here that by (G-Twist) it is clear that if G(-,xo,zq) is supporting to u at xq, we 
must have zo = H{xq,xq, zq). Also note that in the definition above, it is not assumed that 
(x,xq, zo) £ g for all x £ fi, but only for xq. This distinction will motivate further definitions 
below. 

Definition 4.15. Let u be a G-convex function and x £ Q. We define the G-subdifferential of 
u at x as the set-valued mapping 

d G u (x) := {x £ fi | 3 z £ M s.t. G(-, x, z) is supporting to u at x} . 

For x 6 D 0 , we define 

d G u (x) := < lim Xk \ Xk £ d G u (xk ), fi 3 Xk - > x 

I k—foo /c—>oo 

Also, for any A C U cl , we define 

d G u(A) := |J d G u(x). 

xeA 

If x £ we will say G(-,x, H(x, x, u(x))) is supporting to u at x only when x £ d G u (x). 
With this notion in hand, we are now able to define an appropriate weak notion of solutions 
to the generated Jacobian equation (GJE), which will allow for measure valued data. 

Definition 4.16. Let ^bea positive Borel measure defined on fb We say a G-convex function 
u on Q is an Aleksandrov weak solution of the generated Jacobian equation if for any Borel 
measurable 4 C fi we have 

\dcu(A)\ c = n(A). 

We recall that \dcu(-)\ c is a Radon measure (see [Trul4b, Section 4]) known as the G- 
Monge-Ampere measure of u. 

Remark 4.17. In this paper, we are concerned with the specific case corresponding to equation 
(GJE) when the function if> G on the right hand side is bounded away from zero and infinity. 
Thus, in the sequel we will say G-convex function u on kl is an Aleksandrov solution of (GJE) 
(with bounded right hand side) to mean there exists a constant A > 0 such that 

A ^ 1 | A n Ho|£ < | d G u {A)\ c < A \A n , any Borel set A C fb 

Here Ho is the support of \d G u{-)\^. 

Definition 4.18. We say that a G-convex function u is nice (in kl) if u < u < u on H cl . 

We also say a G-convex function u is very nice (in Q if every G-affine function supporting to 
u in H cl is nice (thus in particular, u is also nice). 




20 


N. Guillen and J. Kitagawa 


Remark 4.19. If u is a nice G-convex function, (Lip Ao ) combined with a standard argument 
implies u is locally bounded, and also locally Lipschitz (and in particular continuous) in fl cl . 

As a result, a nice G-convex function is differentiable a.e. on f2. 

Indeed, fix any point xq G Q. Since u is nice then u(xq) G {u + e,u — e) for some e > 0, small 
enough that B e / Ko (x o) C II (where Kq is the constant in (Lip Ao )). We first claim that 

u(x 0 ) - e < G(y,x,H(x 0 ,x,u(x 0 ))) < u(x 0 ) +e 

for all x G fi cl and y G B t / k 0 {xq)- Indeed, let y G B e / K0 (xo) cl and write y g (s) for the unit speed 
minimal geodesic from xq to y (we may first shrink e to ensure such a minimal geodesic exists 
for every point within the boundary of the ball). Fix an arbitrary x G fi cl and define 

s* := sup {s** G [0, d g (x 0 , y)} \ G(y g (s), x,H(x 0 , x, u(x 0 ))) G (u(x 0 ) - e, u(x 0 ) + e), V s G [0, s**]} . 

If s* = d g (xo,y), we are done. Otherwise, we must have G(y g (s*),x, H(xq,x,u(xo))) equal to 
either u(x o) — e or u(x o) + e, thus by (Lip Ao ) we can calculate 

e = \G(y g (s*),x,H(x 0 ,x,u(xo))) - it(xo)| 

(DG(y g (s),x,H(x 0 , x, u(x 0 ))),y 9 (s))ds 

< Ka s* < K 0 d g (xo,y) 

which is a contradiction, thus we obtain our first claim. 

Now take any y G B € / Kq (x o) and let y G Oqu (y), then we have G{x o, y, H(y , y, u(y))) < u(x o), 
thus combined with the above bound 

u(y) = G(y, y, H(y, y, u{y ))) 

= G(y, y, H(x 0 ,y, G(x 0 , y, H(y, y, u(y))))) 

< G(y, y, H(x 0 ,y, u(x 0 ))) < u(x 0 ) + e. 

On the other hand, if xq G Squ (xo), we see that u(y) > G(y, xo, H(xo,xo, u(x o))) > u(x o) — e, 
thus we find that u G (u(x o) — e,u(x o) + e) C (u, u) on B e / Ko (x o) cl , i.e. u is locally bounded in 

n. 

By following the same line of proof as above, we can see that for any yi, y G B € / 2 k 0 ( x o) c1 and 
yi G dcu(yi), we have G(y,yi, H(yi,yi,u(yi))) G (u,u). Then by choosing Af to be a small 
enough geodesically convex neighborhood of .To contained in B e / 2 K 0 ( x o) cl , by (Lip A ' Q ) we find 
for any yi, y 2 G Af, 

u(yi) - u{y 2 ) < G(yi,y 1 ,H(y 1 ,y 1 ,u(y 1 ))) - G(y 2 ,yi, H(y 1 ,y 1 ,u(y 1 ))) 

< K 0 d g (y 1 ,y 2 ). 

By a symmetric argument, u is locally Lipschitz in J7. 

Remark 4.20. If u is a very nice G -convex function, there exists a compact subinterval [u N , un] C 
(u,u) of such that u N < m < «n on fl cl for any G-affine function m, supporting to u in II. 
Indeed, note that 

sup | m(y) | y G fi cl , m is G-affine and supporting to u in H cl | 

= sup \ G(y, x, H(x, x, u(x))) \ y, x G II cl , x G dcu ( x ) > 
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and as u is very nice (by Remark 4.19 above, u is continuous on A2 cl ), the constraint set in the 
second line is clearly compact. A similar argument holds for the infimum. We will refer to this 
subinterval as a very nice interval associated to u. 

Remark 4.21. One of our ultimate goals is to apply Theorems 2.1 and 2.2 toward regularity 
of weak solutions of (GJE) (see [Trul4b, Section 4] for a definition and discussion). However, 
when (u,u) ^ M in (Unif), we may only be able to apply our estimates Theorems 2.1 and 2.2 to 
a very nice G-convex function u. This is to be expected as one feature of this case is that weak 
solutions of (GJE) with the same data may have differing regularity (see Sections 3.1-3.3). 

The following adaptation of the condition (G5) in [Trul4b] due to Trudinger (also shared 
with us through personal communication [Trul4a]) gives existence of weak solutions of (GJE) 
that are very nice. Indeed, define 

dn(x i,s 2 ) : = inf {^( 7 ) I 7 C H cl , 7 ( 0 ) = 37 , 7 ( 1 ) = s 2 , 7 piecewise G 1 j , 
diamn(fi) : = sup dn(si,s 2 ), 

Xi, 

(here L g { 7 ) above is the length of a piecewise G 1 curve in ( M,g )). Then assume that the 
constant Kq in (Unif) satisfies Kq < ■ Then writing I\\ := Kosup xe ^dci(xo,x), for 

any measurable, bounded data, xq G H, and uq G (u + K\ , u — K\), there exists a nice weak 
solution u of (GJE) with u(x 0 ) = uq (see [Trul4b, Theorem 4.2]). If uq G (u + 3K\,u — 3K±), 
u will be very nice', the argument is similar to the one in Remark 4.19. 

The next notion is that of the G-dual of a set A C H cl . 

Definition 4.22. Let A C H, x G A mt , A > 0, and m be a G-afhne function. We define the 
G-dual of A with vertex x, base rn, and height A by 

A x,m,\ := jx G H cl | G(y, x, H(.x , x, m(x))) < m(y ) + A, V y G A j . 

In other words, x G A^ m A if and only if there exists some 2 such that 

G(x, x, z ) = m(x) and G(y, x, z ) < m(y) + A, V y G A. 

The following Propositions 4.23 and 4.28 make essential use of the conditions (G*-QQConv) 
and (G-QQConv). 

Proposition 4.23. If u is a nice G- convex function, then [dcu (a?)] x ,u(x) convex f° r an Y 
iGSl. 

If A C H cl is connected and m is a G-affine function with u < m < u on A cl , then 
is convex for any 0 < A such that sup^ m + A < u and x G A mt . 

Proof. Begin by fixing iGU and xo, xi G Oqu (x). We let 

x(t) := [s 0 , Xi\ x , u ( x ), z(t) := H(x, x(t),u(x)), 

and define 

P(y) ■= sup G(y,x(t),z(t)) 
te [ 0 , 1 ] 

for any y G H. Note since u is nice, x(t) is well-defined and contained in H cl by (DomConv*). 
Also as a result, by (Lip A - 0 ) we see p is continuous on H. Now consider the set 

Vt' ■.= {y G n | pipy) < u(y)} . 
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Clearly x G U 7 , and U 7 is relatively closed as a subset of U. We now aim to show that U 7 
is relatively open, then we would obtain U 7 = U since U is connected by (DomConv). Since 
u(x) = G(x,x(t), z(t)) for all t G [0,1] by construction and u is nice, (Unif) implies that 
(x, x(t), z(t)) G g for all t G [0,1]. As a result, we would have [xo,xi\ xu r x \ C d G u(x), proving 
the proposition. 

Note that since fl cl is compact and u is nice, there exists some e > 0 such that u+e < u < u—e 
on fi d . Suppose that yo G U 7 ; thus p(yo) < u(yo) < u — e. By continuity of p, there exists 
(5 > 0 such that p(y) <u — e/2 for all d g (y, yo) < 6. Fix such a y, we claim that p(y) < u(y) 
as well. If p(y) < u + e, the claim is immediate. Otherwise let [to,fi] C [0,1] be the maximal 
subinterval on which G(y, x(-), z(-)) > u + e that also contains a value t y G (to,t\) where 
G(y, x(-), z(-)) is maximized; by possibly reversing the parametrization of x(t) let us assume 
G(y, x(to), z(to)) > G(y, x(ti), z(ti)). Thus for any t G [toTi] we have G(y,x(t), z(t)) G (u,u), 
and in turn by (Unif), (y,x(t), z(t)) G 0 . As a result we can apply (G*-QQConv) to the 
reparametrized G-segment x(t) := x((l — t)to + tt\) to obtain 

G{y, x(t y ), z(t y )) < G(y, x(t 0 ), z(t 0 )) + ^ [G(y, x(h), z(ti)) - G(y, x(t 0 ),z(t 0 ))] + 

t i — to 

= G(y,x{t 0 ),z(t 0 )) < u{y) 

as desired (the constant M here actually depends on the specific value of u(x), but it clearly 
does not affect the final inequality). This last inequality is due to the fact that G(-, x(0), z(0)) 
and G(-,x(l), z{ 1 )) are supporting to u from below (in the case to = 0 ), while u + e < u(y) (in 
the case to > 0 ). 

To obtain the second claim, repeat nearly the same proof with xq, x\ G A , using x(t) := 
[xo,xi\ xMx) , z(t ) := H(x,x{t),m{x)), and U 7 := {y G A \ p(y) < tn(y ) + A}. □ 

Corollary 4.24. Suppose u is a nice G -convex function as above. If m(-) = G(-,x,z) is a 
G-affine function such that m(x o) = u(x o) and m < u in some neighborhood of ro G O, then 
x G d G u(x 0 ). 

Proof. Suppose m is such a G-affine function, locally supporting from below at a point xq G U. 
Recall the subdifferential of u at xq, 

du{x 0 ) = {pG Tf 0 M | u(exp XQ v) > u(x 0 ) + (p,v) + o(|v| fla!0 ), v -4 0} 

is a closed convex subset of Tf Q M, compact since u is nice; here exp XQ is the usual Riemannian 
exponential map. We pause to remark here that since G is not assumed to be G 2 in the x 
variable, u may not be semi-convex; however since it is G-convex, it is easy to see that du{x ) 7 ^ 0 
for any iGll. By our current assumptions, Dm{x 0 ) G du(x 0 ). Our goal will now be to show 
that du(x 0 ) = [d G u (xo)] Xo ,u(xo)> wou ld conclude the corollary as x = exp XQ u ^ c ^(Dm(x 0 )) 

by (G-Twist) (recall, since u is nice, by (Unif) we have (xo,x,z) G 0 ). 

To this end, let po be an exposed point of du(x 0 ), i.e. for some unit length vq G T Xq M , 

( P~Po,v o )<0 , V p G du(x 0 ) \ {po} • (4.4) 

We will show that (xo,po) is a limit in T*M of (xk, Du(xk)) for some sequence Xk —>■ xq- If this 
were the case, since u is nice, by (G-Twist) and (Unif) we can see that jexp(' u ^ Xk ^(Du(xk ))| = 
d G u(xk ) for each k. Then by continuity of G and u, we have that exp X(j u ^ XQ ^(po) € d G u(x 0 ), 
thus we could conclude that any exposed point of du(x 0 ) is contained in [d G u (.'Co)]a: 0 ,u(x 0 )- 
Since by [Roc70, Theorem 18.7], du(x 0 ) is the convex hull of its exposed points, combining 
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with Proposition 4.23 we would obtain du(x o) C [d G u (xo)\ XO:U ( xo y The reverse inclusion is 
immediate, hence this would complete the proof. 

Now by Remark 4.19, u is differentiable almost everywhere, hence we can choose a sequence 
v k G T Xo M such that u is differentiable at x k := exp^ v k while - > vo, and (x k , Du(x k )) 

converges to (xo,Poo) i n T*M for some p^ G T* Q M. In particular, 

u(xk) > u(x 0 ) + (po,Vk) + o(\v k \g X0 ), 

u(x 0 ) > u{x k ) + (Du{x k ),exjp~l x 0 ) + odexp” 1 x 0 \ 9xk ). 

Plugging the second inequality above into the first, canceling terms, and dividing both sides by 
\ v k\g XQ , we obtain 

0 > (po, ^ ) + <■ Du(x k ), X ° ) + \vk\g xo {o(\v k \ gxo ) + o(|expJ fc 1 x 0 | ai! J). 

By using geodesic normal coordinates around xq, we find taking k —> oo that this leads to 

0 > (po - Poo, vo). 

However, by continuity, exp^ u ( Xk )(Du(x k )) —> Xoo for some Xoo G d G u (xo). Since u is nice, by 
(Unif) we must have (xo,Xoo,H(xo,Xoo,u(xo)) G g, thus we see that 

poo = DG(x 0 , Xoo, H(x 0 , Xoo,u(xo))) G du(x 0 ), 

and by (4.4) we must have po = p ^ as desired. □ 

Definition 4.25. Suppose u is a nice G- convex function, m is G-affine, and let S := {x G H | 
u(x) < m(x)} with xo G S' 111 *. Then the G-cone with base S, vertex xq, and height m(xo) — u(xo) 
is the function defined by 

K x 0 ,s( x ) '■= sup {G(x,x,H(x 0 ,x,u(x 0 ))) | x G U, G(y,x,H(x 0 ,x,u(x 0 ))) < m(y), V y G 5} . 

Remark 4.26. Since u is G-convex, clearly s {xq) = u(xo). Now s may not be G-convex 
on H (given x G fi, it is not clear that there exists an x G 0 for which (x, x, H(x, x, K xq s (®))) e 
g). However, we can see that since u is nice, by (Unif) we have at the vertex xo, 

d cKx 0 ,S ( x o) = {x G Cl | G(y,x,H(x 0 ,x,u(x 0 ))) < m(y ), V y G 5} / 0. (4.5) 

Also note, as long as u is nice the proof of Proposition 4.23 yields that [d G K XQ s (£o)]a; 0 ,u(xo) is 

convex. 

Lemma 4.27. Suppose u, m, xo G S mt are as in Definition 4.25, and suppose S C H mt . Then 

daK^s (s 0 ) C d G u (S ). 

Proof. Fix x G d G K XQ s (xo) and define 

^max := ma xH(x,x,u(x)), 

x£S 

then Z max = H(x max , x,u(x max )) for some x max G S cl ; since u is nice, by (Unif) it follows that 
(a; m ax, x, £max) G g. Since Z max > H(x,x,u(x)) for all x G S and G z < 0, it follows that 

G(x, x, Z mav ) < G(x, x, H(x, x, u(x))) = u(x), V x G S, 

Xmax, X, Z m ax) — wf^max) • 


while 
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Now if x max E S 9 , we can calculate (recalling that x E dcK^ 0 s (xo)) 

G(x max ,x, H(x 0 ,x,u(x 0 ))) < m(x max ) 

— tt(Xmax) — G(x max , X, Z max ) 

by the definition of z max . Then applying H(x max ,x, •) to both sides, we have 

H(x 0 ,x,u(x 0 )) > ^maxi 

in other words we may actually choose x max = xo- Thus in any case, we may assume x max E 
5 mt ; then G(x, x, z max ) locally supports u from below in S. In particular, x G dcu(S) by 
Corollary 4.24. □ 

Proposition 4.28. Suppose m(-) := G(-,x,z ) is a nice G-affine function, and let S := 
jx E D cl | u(x) < m(x)}. Then [<S]x,z is convex. 

Proof. We remark that since m is nice, (Unif) implies [fl cl ]x,z is well-defined; in turn (DomConv) 
implies it is convex. 

Fix any arbitrary G-affine function m(-) = G(-,x,z), and let S := {x G I4 cl | m(x) < m(x)}. 
Consider xo, xi G S , and let x(s) := [xo,xi] Sz ; again since m is nice, (Unif) and (DomConv) 
implies x(s) is well-defined and remains in D cl for all s G [0,1]. 

Now suppose 

G(x\,x,H(xq, x, m(xo))) > G(xi,x,H(xq,x, m(x o))) = m(xi). 

Clearly the expression on the left is in the domain of H(x i, x, •), while m(xi) is as well since m 
is nice. Thus we can take G(xo, x, H(x±, x, •)) of both sides (which preserves monotonicity), to 
obtain 


m(x 0 ) = G(xo,x,#(x 0 ,x,m(x 0 ))) > G(x 0 ,x,i7(xi,x,m(xi))), 
thus by possibly relabelling xq and xi, we can assume that 

G(xi, x, H(xo, x, m(xo))) < m(x i). 

Now since m is nice, u < info m < sup^ m < u. Thus we may apply (G-QQConv) along x(s) 
with [«q,Fq] = [info m, supo m] (also with some associated constant M > 1). Doing so we find 
that 


m(x(s)) = m(x(s)) + Ms [G(xi, x, H(x o, x, m(xo))) — m(xi)] 

> G(x(s),5, H{x 0 ,x,m(x 0 ))) 

> G(x(s), x, H(xo, x, m(xo))) = m(x(s)). 

Here the inequality in the last line is due to the fact that m(xo) < m(x o), combined with 
monotonicity properties of H and G in the scalar parameters. As a result, we see that is 

convex. 

Finally note that, it = supm for some collection of G-affine functions m. Thus we can see 
that which by the first part of the proof is an intersection of convex sets and 

must be convex itself. □ 
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4.4. G and the Riemannian metric. From this point through the end of Section 8 , we 
assume that G satisfies (G-Twist), (G*-Twist), (G-Nondeg), and (G-QQConv), (G*-QQConv), 
and let u be a very nice G -convex function with associated very nice interval [u N , %] C (u, u). 


Remark 4.29. By an abuse of notation, we will often refer to a very nice constant , by which 
we mean a constant that depends on [u n ,?Zn]> the domains fl, U, the dimension n, and the 
constant Kq in (Lip^) through the following quantities: the modulus of continuity of E and 
E~ l , sup |det.E| , sup |det£'| ±1 , sup1111^ 1 , sup||£ l || ±1 (|| ■ || being the Hilbert-Schmidt norm 
of the matrix), inf|G z |, sup|G 2 |, inf| H u |, sup|77 u |, and M > 1 corresponding to [u n ,un] from 
(G-QQConv) and (G*-QQConv). All suprema and infima above are taken over 2 ; E O, r S 0, 
u G [u n ,un], and with the understanding that z = H(x,x,u)\ the above quantities can be 
assumed finite and nonzero by (G-Nondeg) and (Unif). The rationale for this terminology is 
that in various situations, fl, fi, n, and the various quantities involving G and H are fixed, 
with the only real dependence on the constant coming from the range of the scalar parameter u 
which will be constrained in the interval [u n ,un]; since we generally fix one very nice function, 
the interval [u n ,«n] will be fixed as well. 

Lemma 4.30. If (x, z ) G fl cl x M satisfies the condition 

G(-,x,z) G [u n ,«n] on all of IT 1 , (4.6) 

then Px,z{') is a bi-Lipschitz mapping from IT 1 to z . Moreover the Lipschitz constants of 

both this map and its inverse are bounded by some very nice constant. 

Similarly, if (x,u) G fi cl x [u n ,5n], thenp XjU (-) is a bi-Lipschitz mapping from U cl to [fT 1 ]^, 
and the Lipschitz constants of p x ,u{ m ) and its inverse are bounded by a very nice constant. 


Proof. Before we begin, recall the definitions of do, and L g introduced in Remark 4.21. Fix 
(x,z) G IT 1 x M satisfying (4.6). By (Unif), we then have (x,x,z) G 0 for any x G IT 1 . Fix xi, 
X2 G IT 1 , then by (DomConv) the G-segment x(s) := [ 21 , 2 : 2 ^ is well-defined and remains in 
IT 1 , in particular it is differentiable for all s G [0,1]. 

Then by (4.1) we calculate 

L g ([xi,x 2 \ StZ )= / \-G z (x(s),x,z)E~ 1 (x(s),x,z)(p S; ,z(x 2 ) - Px,z{xi))\ 9xM ds. (4.7) 
•J 0 

Now since G{x\,x,z) G [u n ,un], we see that —G z (x(s),x,z) has very nice, positive upper and 
lower bounds, while the operator norms of E^ 1 (x(s), x, z) and E(x(s),x, z) also have very nice 
upper bounds. Thus we see for some very nice G > 0, 


C I Px,z(x 2 ) ~Px,z{x l)|g s < Lg([x 1,X 2 \ S , Z ) < C\px,z{x 2 ) ~Px,z{xi)\g s - 
Clearly we always have d g (x \,x 2 ) < L g {[x\, x 2 ]^ z ), so this implies that expf^(-) is globally 
Lipschitz on [U cl ] Si2 with a Lipschitz constant that is very nice. 

We now prove px,z(r) is globally Lipschitz from IT 1 to [U cl ] Sj2 , and its Lipschitz constant is 
bounded by some very nice constant. Indeed, first note that this mapping is G 1 on IT 1 , with 
C 1 norm bounded by a very nice constant, thus it is sufficient to show there exists a very nice 
constant G > 0 such that for any 2:1, 2:2 G IT 1 , there is a piecewise G 1 curve 7 connecting x 1 to 
X 2 , remaining entirely within U cl , for which Cd g (xi,x 2 ) > L g ( 7 ). Suppose this is not the case, 

then there is a sequence xl, x? for which 0. By compactness of I7 cl , we can assume 

x\ and x\ converge. By (4.7), we see that dn(x\,x\) has a uniform, very nice upper bound, 
hence it must be that d g {x\,x^) —> 0. This implies that both x\ and converge to some 
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Xoo E U cl , while by continuity of Px,z( m ) on D cl , we must have Px,z{x\) and Px,z{x\) converging 
to some Poo E [D cl ] i)2: . Additionally, note that since Px,z{') has a very nice upper bound on 
its C 1 norm, it is locally Lipschitz in D mt with a very nice constant. Thus if Xoo E D mt , by 
combining with (4.7) we would have for large enough k, 


d g{ x h x l ) > d g( x h x l) 


> 


d g{ x b x k ) 


M x h x l) L g(i x l.*2]x,a) C\px, z (x 2 ) - Px,z( X l)\g. 


>C, 


9x 


a contradiction. Thus it must be that the limiting points Xoo E ft 9 and Poo E \ft 9 ]x,z- 

At this point we make an aside to show that the domain ft has a Lipschitz boundary in the 
sense that any point in ft 9 has an open neighborhood (in M) on which it can be represented as 
the graph of a Lipschitz function in some local coordinate system, where the Lipschitz constant 
of this function is uniformly bounded. Since ft 9 is compact, it is clearly sufficient to show 
the boundary is locally Lipschitz. Fix a point x € ft 9 , and a small open neighborhood O 
of x in M. By the extension lemma (for example, [Leel3, Lemma 2.27]) there exists a C 1 
extension of Px,z( m ) to O. Since the derivative of Px,z(') is invertible on U cl by (G-Nondeg), 
by possibly shrinking O we can assume that this extension is also a C 1 diffeomorphism on O. 
We continue to use the notation exp!? 2 (-) to refer to the inverse of this extension. Now take 
a ball B C T|M centered at p := Px,z(x), small enough so its closure is contained in \Q\x,z- 
Then B n [U]^ is open, [0\x, z is an open neighborhood of the closure of B n [U]^, and p is 
contained in the boundary of B n Moreover, B n is convex by (DomConv), hence 

has a locally Lipschitz boundary. Thus we can apply [HMT07, Theorem 4.1] to find that ft 9 is 
locally Lipschitz near x, finishing our aside. 

Finally, we return to our main argument. Fix local coordinates near x^, using these coor¬ 
dinates we identify a neighborhood of Xoo with a subset of M n . By the aside above, we find a 
neighborhood on which D cl is written as the graph of a Lipschitz function <L, over some sub¬ 
set of M n_1 in these coordinates. If k is large enough, then xl and xl are contained in this 
neighborhood. We now define a special curve 7*.. Draw a straight line segment between 
and x?. If this segment does not intersect fl 9 , then we take jk to be this segment. Otherwise, 
between the first and last points where the segment intersects ft 9 , take 7^ as the image under 
of the projection of this line segment onto M"^ 1 . Clearly 7*. is then a Lipschitz curve with 
L g (7/c) < ^ d g{ x k^ x k) f° r some constant C > 0 depending only on the domain D (independent 
of k). In turn, this implies a bound d^i{x\,x |) < Cd g (x\,xfy on the intrinsic distance, thus we 

cannot have 9 V . 1 ’ j; —> 0, finishing our proof. 

“01,27,27) _ 

The statement concerning ft is proven similarly, but using (4.2) and (4.3) in place of (4.1) in 
obtaining the analogue of (4.7), and(DomConv*) in place of (DomConv) in various places. □ 


The above lemma immediately yields the following corollary. 

Corollary 4.31. Let us write a ~ b to mean there exists a very nice constant C > 0 for which 
C~ l a < b < Ca. Then under (4.6), we have 

dg(xi,x 2 ) ~ \Px,z(xi) - Px,z{x2)\g s , Vxi, x 2 E D cl . (4.8) 

Also for any (x,u) E D cl X [rt N , mn], 

dg(x!,x 2 ) ~ \px,u(xi) - Px,u(x2)\g x , Vxi, x 2 E D cl . (4.9) 






Pointwise estimates and regularity in geometric optics and other generated Jacobian equations 


27 


Finally, in each of the respective situations above, we have 

\A\c ~ I 5 

\A\ c ~\[A] XjU \ c (4.10) 

for any measurable A C 14 cl or A C 14 cl . 

Finally, we present a lemma relating the difference of two G-affine functions with the differ¬ 
ence of their linearizations. The lemma relies on (G*-QQConv) in a crucial way. 

Lemma 4.32. Let xq G 14, xq, x\ G 14, uq £ [u n ,un]> zq := H(xo,xo,uo) for i = 0,1, and 
x(t) := [xo>®i] . Then there exists a very nice constant C > 0 such that for any x G 14 

satisfying (x, x(t), H(xq, x(t), uq)) G g for all t G [0,1], we have 

(Px 0 ,zo(x) - Px 0 ,zo(xo),E~ 1 (x 0 ,xo,zo)(p Xo>Uo (x 1 ) - px 0 ,uo(xo))) 

< [ G ( x ,xi, H(x 0 ,xi,uo)) - G(x,x(t'),H(x 0 ,x(t'),uo))] + , V t' G [0,1). (4.11) 

Additionally, for any x G 14 such that x(s) : = [xo,x] s Zg is well-defined and contained in 14 cl , 
we have 


| G(x, x(t), H(x 0 , x(t),u 0 )) - G(x, x 0 , z 0 )\ 

— Gt\px 0j z 0 (x) — Px 0 ,Zo (iCo) Isojq \Pxo,Uo{x\) — Pxo,uo(xo)\g xo , V t G [0,1]. 
Proof. To obtain the first inequality we first calculate (also using (4.2)): 
d 


(4.12) 


—G(x,x(t),H(xo,x(t),u Q )) 


t =o 


= ( DG(x,x 0 ,z 0 ) + G z (x,x o ,z o )DH(x o ,x o ,u 0 ),x(0)) 

DG 

= G z (x,x 0 , z 0 )(-^-(x, x 0 , z 0 ) + DH(x o ,x o ,u o ),x(0)) 

Z 

DG DG 

= -G z (x,x 0l z 0 )(- — (x,x 0 ,z 0 ) + — (x 0 ,xo,zo),E~ 1 (xo,xo,z 0 ){p Xo , Uo {xi) - p XQ}Uo (x 0 ))), 

&Z z 

and note —G z (x, xq, zo) = —G z {x, xo,H(xo, xq, uq)) has strictly positive upper and lower bounds 
that are very nice. The inequality (4.11) then follows by dividing (G*-QQConv) through by 
t > 0 and taking the limit as t \, 0. 

The inequality (4.12) follows by a simple but tedious calculation: 

G(x, x(t), z(t)) - G(x, X 0 , Zo) 

= G(x(l),x(t), z(t )) — G(x{l), x(0), z(0)) -I- G(x(0),x(t),z(t)) - G(x(0), x(0), z(0)) 

= J () J~ s x(t), z (t)) - G(x(s), x(Q), z(0))) ds 

= f (DG(x(s), x(t), z(t)) — DG(x(s),x(0), z(0)), x(s))ds 
Jo 

= Jo Jo ^^ DG ^ s ^^ t '^ z ^ t '^' ± ^ dsdt ' 

= f f (DDG(x(s), x(t'), z(t'))x(t') + DG z (x(s), x(t'), z(t'))z(t'), x(s))dsdt'. 

Jo Jo 
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Now if we write 

P := Pxo,zo { X )l Po - = Pxo,zoi X o) y 
Pi := Pxo,uo (®l)j Po '- = Pxo,uq{xo) 1 

by Proposition 4.12 (4.1), (4.2), and (4.3), the final expression in the calculation above can be 
written as 


i 

(M t ’ >s {p i ~Po),p-po) + (V t ' j8 ,p-po)(V t >,pi - p 0 )dsdt' 

for some linear transformations M t ^ s : T* Q M —> T X M , and vectors Vt> t3 E T X M, V# E T Xq M. 
As uo E [u n ,un]) a routine calculation yields that |V^'| g , and the operator norm of 

M t j s have very nice bounds, thus by applying Cauchy-Schwarz we obtain (4.12). □ 



5. An Aleksandrov-type estimate 

Definition 5.1. If A C T?M is convex and u E S n_1 C T|M is a unit direction, we denote 
the supporting plane to A with outward normal u by 11^. 

We also recall the standard notion from Riemannian geometry of the musical isomorphism 

Definition 5.2. If v E TgM for some x E M. then we define v ® E T X M implicitly by the 
relation 


(v^,w) = g x (v,w ), V w E T|M. 

The map f : TJM —> T X M is called the musical isomorphism. 

Remark 5.3. We also recall the following very simple elementary formula for the distance from 
a point in a set to a supporting plane of the set: if A C T|M is convex, po E A, and v E T£M 
is unit length for some x E M, then 

d(Po,n^) = swpg x (v,p-p 0 ) = sup g s (v,p-p 0 ). 
p&y i p&a 8 

Theorem 5.4 (John’s Lemma). If A C is a convex set with nonempty interior, there exists 
an ellipsoid E whose center of mass coincides with that of A, and a constant a(n) depending 
only on n such that 


a(n)E C A C E. 

In this section, we assume the hypotheses of Theorem 2.1. Namely, we fix a very nice G- 
convex function u with associated nice interval [u N , %], a nice G-affine function m = G(-,x, z), 
and a point xq E S mt where S := {x E Q \ u(x) < m(x)}. Again, since m is nice, by (Unif) and 
(DomConv) we have that [U] i>2 is well-defined and convex. We also assume that diam(S') < e 
for some very nice constant e > 0, to be determined, and there exists some ball B C T£M such 
that [S 1 ]^ CBC 3 B C [kl] x ,z (this ball B does not have to be of uniform size). 
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Lemma 5.5. Given w G§" C T|Af. there exists x u G dcKff Q s (x 0 ) and a ven/ nice constant 
C > 0 such that 

I E(xq, X, z)uj^ Pxo,u(xo)(^u) ~ Pxq,u(xq)(x) | < 1 (5 1) 

\E[xq, x, z')io'^\g xo \Pxq,u{xq) (x U] ') Pxq,u(xo) Od 2 \/n 

C '|Pxo,u(*o)(®«) -P*o,«(*o)(®)ls-o - - “1^ \ • ( 5 ' 2 ) 

d^(x 0 ),n [5]s J 

Proof. Fix an w 6 §" C T|M, and take any x w G such that Px,z(x u ) G [S 1 ]^ H If^ . We 
intend to follow the proof in [GK15, Lemma 4.7], but since u may not be semi-convex we must 
find an appropriate alternative to [Roc70, Corollary 23.7.1] which was utilized there. First we 
define U(p) := it(exp® 2 (p)) — m(exp® 2 (p)), it is clear that this function is G-convex on [fl ]^ i2 
where 

G(p, x\ z') := G(exp!? 2 (p), x', z') - m(exp!? 2 (p)), 

with g := {(p, x',z') | (exp® z (p), x', z') G g}. G has the same C 2 regularity as G , and satisfies 
the same set of conditions, including (G-QQConv) and (G*-QQConv). Thus as in the proof 
of Corollary 4.24, we can show that dU(p ) = {lim^o a DU(pk) \pk~^p}- Now by [Cla90, 
Theorem 2.5.1], this implies that dU(p ) = d c U(p ), where d c U is the Clarke or generalized 
subdifferential of U (see [Cla90, Chapter 2.1]). Since [<S , ]i, 2 = {p G [f2 cl ] S)2 | U(p) < 0}, by 
combining [Cla90, Theorem 2.4.7, Corollary 1] and [Cla90, Proposition 2.4.4] we find there 
exists a t* > 0 such that t*u G dU(px, z {xu)) (again, identifying T*_ _ = TfM = T%M ). 

Thus we can continue as in the proof of [GK15, Lemma 4.7], to find that if 

x u (t) : = exp® im ( !Bu )(Px u ,m(x«)&) +tE(x 0J ,x,z)u i ) 

= ex Px Wl «(* w )(P*u,,«(x w )(®) + tE(x u ,X,z)u t ), 

then ^(f*) G dcu(x UJ ) (we have used here that m{x u ) = u(x U] ) since S 9 C fl int ). 


(5.3) 
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Then writing z u (t) := H(x u ,Xu(t),u(xu)), we see that G(xo, x w (t*), z u (t*)) < u(x o) while 
G(-, Xa,(0), ,^(0)) = m, hence there exists some value 0 < t** < t* for which 

G(x 0 ,x OJ (t**),z u (t**)) =u(x 0 ); (5.4) 

let us write 

Xu : = Xu(t**), z u := Zu{t**). 

Since m is nice, we may take := jy G S | sup ig [ 0 t .] G(y,x u (t), Zu(t)) < m(y) | and x u (t), 
z u {t) in place of x{t), z(t) in the proof of Proposition 4.23 to see that G(-, x u , z u ) < m(-) on 
S, or in other words (recalling (4.5)) x u G 8gK xq s (xo) as desired. 

Now recalling that [S 1 ]^ C B for some ball, it is not hard to see that writing 

P0 -=Px,z( x o), 

the orthogonal projection of po onto lTj^_ is contained in 3 B C [D] S)2 , and hence the whole 
line segment in between (by (DomConv) and since m is nice). At the same time, Bg '■= 
[{G(-,x u ,z u ) < m(-)}] S)2 is convex by Proposition 4.28, and by differentiating G{-,x u ,Zu) it 
can be seen that u is an outer unit normal to Bg at Px,z(xu)- Thus, there exists p\ in the 
intersection of Bq with the ray {pq + suj \ s > 0} with 

x\ ■■= exp^(pi) G n, 

and 

\pi ~ Po\gx < d(po, ]a J, (5.5) 

(see Figure 5). Now let us write 

x(s) := [x 0 , ®i] s> * , x(t) := [x, x u ] Xo>tl(a!o) , 
z(t) := H(x 0 ,x(t),u(x 0 )) 

Pui ■ Pxo,u(xo) (Xu ) > P0 ■ Pxo,u(xq)(x) , 

note that z( 1) = z u by (5.4). Additionally, since u and m are nice, by (Unif), (DomConv), and 
(DomConv*), x(s), x(t) are well-defined and remain in fi cl , D cl respectively. Now if we write 

Pu ■ Pxq,u(xq) {.Xu ), P0 ■ Pxo,u(xo) ( x )j 

by (4.12) combined with (5.5), for some very nice C > 0 we arrive at the inequality 
G(xl,x u ,z u ) - G(xi,x,z{0)) < C\pu -po\gx\Pl ~Po\g XQ 

< C\pu - Po\g x d(po, n ^]- 2 ) ■ (5-6) 

On the other hand recalling the choice of x\, and since u and m are nice, 

G(x\,Xu,Zu) - G(xi,x,z(0)) > Ci(m(xo) - u(x 0 )) + G(xi, x u , z^) - m(x i) 

= Ci(m(xo) - u(x 0 )) 

for some very nice C\ > 0 by applying the mean value property; combining with (5.6) we thus 
arrive at (5.2). 

Finally, define 


v := E(x o, x, z)oj 


w : = (t**) 1 ( PaJ -p 0 ) . 
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Thanks to Lemma 4.30, we know that Px 0 ,u(xo)( ex Px u «(£„)(')) a Lipschitz map on T*^M with 
a very nice Lipschitz constant. Then we find for some very nice C > 0, 

I Pu ~ Po\g XQ 

= |P*o,«(*o)( eX Px w> tt(*„ ) (P*u,,t*(x W )(*) +t**E( Xoj ,X, z )J)) - Pxortxo^eXPx*, ,«(*„)(P*u,,«(*<-)(*)))lfl*0 

< C\Px„,u{x„)(x) + t**E(x u ,x,z)(J - P Xu ,u(x„)(x)\g Xu , 

= Ct**\E( Xuj ,x,z)J\g Xui 

< Ct** 

(here we have also used (G-Nondeg) and the fact that u has unit length). As a result, 

\ w \g * 0 ^ C - 

Now since ui is unit length, we see 1 — < G for a very nice C > 1. Now we claim that if 

\ v \ax 0 

diam (S) is smaller than a certain very nice constant (recall also Lemma 4.30), we can ensure 

I V ~ W \g* 0 < lEVrr- 

To see this, consider the map 

F : Dom (F) C fi cl x Q cl x T*M -> T*M, 

defined by 

F(x 1 ,X,X 2 ,V) := Px ljU (xi) \ ex Px 2 ,u(x 2 )(Px 2 ,u(^)(^) +v )) ~Px l,u(xi)(*). V e T X 2 M - 

Note that 

F(xi, X 2 , 0) = 0, V xi, X 2 , (5.7) 

F(x 0 , x 0 , x 0 , V) = V, V V £ T Xq M. (5.8) 

Moreover F is differentiable with respect to the (. X 2 ,V ) variables along the fibers of T*M , the 
derivative being continuous in all four variables. This continuity depends only on the modulus of 
continuity of E(x, x, z) (and its inverse), and the modulus of continuity of u which is controlled 
by a very nice constant (see Remark 4.19). Thus all constants obtained below will also be very 
nice. From this point on, we will always take x\ = xo and x = xq in F, thus we will write 
simply F(x,V) for brevity. 

Now, the map F was constructed so that the point 

ex Pw(zud(Pw(^)(^) + t**E(x u ,x, z)J ) 

is the same as 

ex P*o,v(*o) (pxo,u(xo)(x) +F(x u ,t**E(x u ,x,z)u*)') . 

Thus, to obtain the desired bound all we have to do is show that 

(t**)- 1 [F{x u , t**E(x u , x, z)J) - t**E(x 0 , x, z)J ) 

has a small enough length. We distinguish two cases, first, let us assume that t** < to for some 
to > 0 to be determined below. Then, differentiating at V = 0 and recalling (5.7) leads to 

(t**)~ 1 F(x U] ,t**E(x UJ ,x,z)uj i ) - E(x 0 ,x, z)ujti 

= D v F(x UJ1 0)E(x U} ,x,z)uj li + o(l) - E(xo,x, z)cv^ 

= (D v F(xu,0)E(xu,x,z) - E(xq,x,z)) + o(l). 
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Where o(l) represents a vector whose length goes to zero as to —> 0. Then, to > 0 is chosen as 
a very nice constant such that 


|(t**) i F(x 0J ,t**E(x UJ ,x,z)u; i ) - E(x 0 , x, z)u% Xu 
< | (D v F(x U] ,0)E(x U] , x, z) - E(x 0 , x, z)) io% Xuj + ^ 4g 1 ^_ . 

On the other hand, since F is continuously differentiable in V we have that 

x i y DyF(x, 0 )E(x, x, z) — E(x o, x, z) 

defines a linear transformation-valued continuous map with respect to x. Furthermore, a stan¬ 
dard computation shows that this map is zero for x = xq. It follows there exists a very nice 
constant do such that if o) < do then 


V ~ W \9*0 


1 

ACy/n' 


This takes care of the case t** < to- Now, suppose that t** > to- Let us choose d small enough 
so that 


\F(x 0J ,t**E(x U] ,x,z)u;' li ) - t**E(x 0 , x, z)w tt | ffa , 0 < 


to 


ACy/n 

which is possible thanks to (5.8) and the very nice control on the modulus of continuity of F. 
Having the desired bound on \v — w\ gxo , we can calculate 


w 


< 


1 


Mff* 0 M<?*o 0 \ v 


< 


\9x 0 

1 


\V-w\g x o +F- 


'9* 0 


w 


w 


9x o 


9 x o 


\v — w 


\9x 0 


9x o + \ \ v \gx 0 l^lg^ol) < 


2 y/n 


and we arrive at (5.1) as desired. 


□ 


Next we apply the above lemma to a specific basis of n directions: let oj 1 E TAM be the unit 
vector of interest in Theorem 2.1 and {w ? -}” =2 be an orthonormal collection in TAM aligned 
with the axial directions of the John ellipsoid of [S]^ in such a way that §& ^ for 

every 2 < i < n. 

Lemma 5.6. For the above choice of {cUj}” =1 , there exists a very nice constant C > 0 such 
that 


c \ d GKx 0 ,s( x o)\ c > (m(x 0 ) - u{x 0 )) n Y[— -— 

i= 1 d^ z (xo),n^ kz 
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Figure 6. 


Proof. Let us write X{ := x u . which are obtained by applying Lemma 5.5 to the directions 
Pi '■= Px 0 ,u{x 0 )(xi) for 1 < i < n, and also p := p Xo>u ( Xo )(x). Then we have p, pi G 
i d C K x 0 ,S (^o)]^,„(*<))> hence by Remark 4.26 


conv {p, pi | 1 < i < n} C [d G K^ oS (ao)]^ )U ( so) . 


Now (5.1) combined with (G-Nondeg) implies that the directions {pi — p }\Li span a paral¬ 
lelepiped whose volume is comparable by a very nice constant to that of conv j | pi — p\ gx ^ \ 1 < i < 
which in turn (due to our assumption on the angles between u: i and WjJ has volume comparable 

n 

to f([ |pi — p\g xo by a constant depending only on n. Combining this with (5.2) and recalling 
i=l 

(4.10) from Remark 4.30, we obtain the claimed inequality. □ 


Just as in the proof of [GK15, Lemma 4.8], and using Corollary 4.31 (the estimate (4.10) in 
particular), we can obtain the following bound. 

Lemma 5.7. There exists a very nice constant such that 


n 

i =2 

It is now straightforward to combine the two last lemmas to obtain the analogue of the 
Aleksandrov estimate. 
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Proof of Theorem 2.1. Multiplying the inequalities from Lemmas 5.6 and 5.7 yields, 

K[s\x,z^i) 


c | 9gKx 0 ,s (®o)| £ \S\ c > (m(x 0 ) - u{x 0 )) r 


> (m(x 0 ) - u(x 0 )) r 


n (j[\ T^i FT Uli 


d^Pxjzfa o)> 

l([S]x, Z,^l) 


H d(px,z(x 0 ), n l 


[S] gjJ 


d(px,z{% o), 2 


Rearranging and applying Lemma 4.27, the theorem follows. 


□ 


6. The sharp growth estimate 

In this section we will work toward proving the estimate Theorem 2.2. The strategy of our 
proof will essentially follow [GK15, Section 3], however we must redo [GK15, Lemmas 3.8 and 
3.10] using our conditions (G-QQConv) and (G*-QQConv). Throughout this section, let us 
fix a G-convex function u, m, and A C S as in the hypotheses of Theorem 2.2: namely that 
u is very nice , u N < m < un on fl cl , KM[A]x lZ C [S 1 ]^ for some very nice K > 0, and 
sup 4 m + sup4 (m — u) <u (we also remind the reader that M is the constant in (G-QQConv) 
and (G*-QQConv) associated to the choice [uq,uq] = [u n ,?Zn])- 

This first lemma replaces [GK15, Lemmas 3.8]; but contains a crucial difference. The under¬ 
lying idea here is that we would like to control \d G u {Af)\^ from above by the G-subdifferential 
of some G-cone at one point (which is much better behaved). This amounts to showing that 
G-affine functions supporting to u also can be vertically shifted to support to a G-cone; as in the 
Euclidean case, one cannot take the whole section S as the base of this G-cone, a smaller dilate 
is taken to make sure the G-cone is “steep enough.” However, in order to show the inclusion we 
must rely on (G-QQConv), thus we essentially must consider a G-cone whose vertex lies on m 
instead of below it in S. Since the dependence of G on the scalar parameter is nonlinear, this 
is no longer a vertical translation of the usual G-cone, thus we must instead consider a related 
G-dual set (compare Definitions 4.22 and 4.25). We also comment here that we do not require 
condition (2.2) in the following proof. 

Lemma 6.1. There is a choice of very nice K > 0 for which 

d G u {A) c A ( l m m Xaup 

where A SU p := sup A (m — u ) and Px,z{x cm ) is the center of mass of [S^z- 

Proof. Again we comment that by the assumptions on m combined with (Unif), we have 
(x, x,z) £ 0 for every x G D; in particular [S^.z is well-defined. 

Fix some x G A and x G d G u(x), and let m(-) := G(-, x, H(x, x, u(x)))', thus m is nice and 
supporting to u from below at x, and u N < m < u n on H cl . Also let m(-) := G(-, x, H (x cm , x, m(x cm 
and let x max be the point in A cl where the difference m — m is maximized. In order to show 
that x G AP m \ , our goal is to show that m(x mSLX ) — m(x max ) < A SUD . 

sup \ / \ / 

We note here by using the mean value theorem in the scalar parameter, 

m(x) - fh(x) = G(x,x,H(x cm ,x,m(x cm ))) - G(x, B, H(x cm , S, rh(x cm ))) 

= G z (x,x,H(x crn ,x,me))H u (x crn ,x,me)(m(x cm ) - fh(x cm )) (6.1) 

where mg := (1 — 6)m(x crn ) + 0fh{x cm ) for some 6 G [0,1]. By our assumptions mg G [u N , wn]j 
in particular the product G Z H U in (6.1) has strictly positive, very nice upper and lower bounds, 
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which we write C SU p and Gi n f. Thus we arrive at the inequalities 

Cinf(m(x cm ) - fh{x cm )) + m{x) < m(x), ( 6 . 2 ) 

C SVLp {m(x cm ) - fh{x cm )) + m{x) > m{x) (6.3) 

for any x. 

Next we can see there exist points x 9 , x^ ax £ S 9 so that x and x max he on [x cm ,x 9 ]- , and 
[zcm, ZmaxjjE,* respectively (let us write x(s) := [x cmi x 9 ]- z and x max (s) := [x cm , x^ ax ] _J. 

Moreover, since KM[A]x jZ C there exist 0 < s, s max < jJjg- for which x(s) = x and 

®ma X (sma X ) = ®ma x - By the boundedness assumptions on m and (DomConv) both of these 
G-segments are well-defined, and by Proposition 4.28 lie entirely in [S']x. z . Additionally, the 
boundedness assumptions on m allow us to apply (G-QQConv) along both of these G-segments 
as below. 

By (G-QQConv) along x max (s), we obtain 

m{x max ) fii ( x ),"i ax ) T Afs max [7?i(x max ) rn(x m . lx )] 

< K-^fhix^) - m(x^ ax )]+. (6.4) 


At this point let us take 

r , 2G sup 

K:= ~cw' 

which again is very nice ; we then consider a number of cases. 

Case 1: If m(x^ ax ) < m(x^ ax ), then the above inequality already implies m(x max ) — 
mix m a x ) < 0 < A sup and we are finished. 

Case 2: Otherwise we can take x = x^ ax in (6.3) and combine with (6.4) to obtain 


mix max) rn(x max ) 5^ I\ [Csup{m{x cm ) m{x cm )) + m(x max ) r?i(a; max )] 
< K~ 1 C sup (m(x cm ) - fh(x cm )) 

= r-{m[xcm) ~ m{x cm )), 


(6.5) 


the second inequality is due to the fact that m(x ^ ax ) < u(x ax ) < m{x 9 n . AX ) since x^ ax G S. 
At this point we can apply (G-QQConv) along x(s) to obtain as above, 


m(x) — m(x) < Ms[m{x 9 ) — m{x 9 )\ + 

< A' _1 [m(x 9 ) — m(x 5 )]+. 


Combining this time with x = x in (6.2), we see that 

K~ l [rh{x 9 ) — m(x 9 )} + 

> G in f(m(x cm ) - m{x cm )) + fh(x) - m(x) 
= G in f(m(x cm ) - rh{x cm )) - {mix) - u{x)) 
^ G; n f {m{x cm ) m{x cm )) A sup . 


( 6 . 6 ) 


Case 2a: If m{x 9 ) < m(x 9 ), by rearranging the above we see m{x cm ) — rh{x cm ) < G jnf A sup 
which combined with (6.5) yields 

m{x max ) m{x max ) < - G- n £ A SU p < A sup 


2 
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as desired. 

Case 2 b: Otherwise in the final case, we once again apply (6.3) with x = x 9 and combine 
with (6.6) to obtain (using that x 9 E S as well), 


Cinf(TTj(iEcm) ^(®cm)) — A S up T K Cg up ( Tit(X cm ) m(x cm )) 

— Asup T ~ (rn(x cm ) m(x cm )) 


or rearranging, 


™ f (m(x cm ) fh(x cm ))< A sup 

Clearly combining this bound with (6.5) gives m(x max ) — m(x max ) < A sup , finishing the proof. 

□ 


With the above Lemma 6.1 and Lemma 4.32 in hand we can connect the G-dual set with the 
usual polar dual from convex geometry (defined below), in the appropriate coordinates defined 
via (G-Twist); this easily leads to our claimed estimate in Theorem 2.2. 

Definition 6.2. Let V be an linear space, A C V, po E A mt , ^ £ L*, and A > 0. The polar 
dual of A of scale A, center po, and base qo, denoted A* q qo A C V*, is the set given by 

A * Po ,qoA := {<? e v * I (<1 ~ Qo,P~ Po) < A, \/peA}. 

Lemma 6.3. There exists a very nice C > 0 such that 

A Z m , m x»,\ c <c\A\pK UI ,. 

Proof. Fix y E ^x cm m x sup an d x £ A] recall that m(-) = G(-,x,z). Note by (Unif) and 
(G-Nondeg), we can see that E^ 1 (x cm ,x, z) is well-defined. We claim that 

(Px,z( X ) - Pcm,E~ l {x cm , X, z)p Xcrnt m{x cm )(y) ~ Q) < Cl Asup (6.7) 

for some very nice C\ > 0 , where 


Pcm • — Px,z\%cm j? 

q : = E- 1 (x cm ,x,z)p XcrnMxcrn) (x). 

First hx an let x(t) := [x, y] Xcm m ( Xcrn ) an d write z(t) := H(x crn ,x(t),m(x cm ))', since 

iri(x cm ) E [u n ,un ] 5 by (DomConv*) and (Unif) we see x(t) is well-defined and remains in f2 cl . 
Now, we can assume 

(Px,z( x ) ~ Pcm, E ^ 1 (x cm , x, z)p Xcm!m ( Xcm) (y) - q) > 0, 
otherwise (6.7) is immediate. First, it is clear that x E Af) m > so the second claim in 

v ' 4 C m)''b^sup 

Proposition 4.23 implies that [x, y] Xcm m (x cm ) C A x cm m A sup ( reca b by our assumption (2.2), we 

have sup m + A sup < u). Combining with (2.2), for any t E [0,1] we must have 
A 

G(x, x(t), z{t)) < m{x) + A sup < u. 

Next let [0, to] C [0,1] be the maximal subinterval (to necessarily strictly positive) on which 
G(x, x(t), z(t)) > u N . By (Unif), we can apply (G*-QQConv) along x(t) on [0,to] (after 
reparametrizing) to see that G(x, x(t), z(t)) cannot have any strict local maxima in (0, to)- 
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The calculation in Lemma 4.32 shows ^G(x, x(t), z(t)) | f=Q > 0 by our assumption, thus we 
must actually have to = 1. As a result 

Mn — G(x,x(t),z(t)) < u, 


or by (Unit), (x,x(t), z(t)) G for all t G [0,1]. We can thus apply (4.11) from Lemma 4.32 to 
obtain for a very nice Co > 0 , 

(Px,z( x ) - Pcm, E~ 1 (x cm , x, z)p Xcm ^ Xcm ){y) - q) 

< C 0 M [G(x,z(l),z(l)) - G(x,z(0),,z(0))] + 

= C 0 M [G(x,y, H(x cm , y,m(x cm ))) - m{x)\ 

< Cc)MA SU p, 


and we obtain (6.7) with C\ := GqM. As a result we see this implies that 


E ( x cm, X, z)[A Xc m t m t \ Bup ]x C rn,m(x C m) E- ([AU,z) 


Pcm ,Q jCl A S up 5 


thus by taking the volume of both sides (recall also Lemma 4.30 and Corollary 4.31) and 
combining with [GK15, Lemma 3.9] (note we do not need [A] S)Z to be convex, as we can apply 
the result to the convex hull of [A] SjZ to obtain the same inequality, using that the polar dual 
of a set is unchanged by taking its convex hull) we obtain the lemma for another choice of very 
nice C > 0. □ 


Proof of Theorem 2.2. Combining the above Lemmas 6.1 and 6.3, the theorem is immediate. 

□ 


7. Localization and strict G-convexity 

In this section and the next one we shall use the estimates from Theorems 2.1 and 2.2 to 
prove the strict G-convexity of a G-convex solution to a G-Monge-Ampere equation with a 
nondegenerate G-Monge-Ampere measure (recall Definition 4.16). 

It is assumed that the support of the G-Monge-Ampere measure (denoted Dg 1 ) lies in the 
interior of D. Moreover, it is assumed that Do := &G U (Do) is such that Dg 1 C D mt and is 
G-convex with respect to (x, u(x)) for all i G Do. 

Remark. One expects the strict G-convexity to also hold in a situation where D and D are 
strictly G-convex, as oppose to assuming that the closure of Do is contained in the interior of 
0. This is for instance what is done in the work of Figalli, Kim, and McCann [FKM13a] in the 
case of optimal transport. 

Moreover, we will assume for the rest of the section that u is a very nice G-convex function. 
Recall that this assumption is needed, even if the data is smooth (as discussed in Section 3.1). 
Thus, for the rest of Section 7 we will fix 

u : D — > M, very nice, and solving (GJE) in the sense of Definition 4.16 for some A > 0. 

(7.1) 

The first of our theorems in this section says that “singularities” (in the sense of failure of 
strict G-convexity), if they happen at all, must propagate all the way to the boundary of D. 

Theorem 7.1. Let u be as in (7.1). If xq G Do and zo are such that mo(-) = G(-,xo,zq) is 
supporting to u at some xq G Dg nt , then the set 

So := {u = mo] 

is a single point, or else every extremal point of [So]x 0 ,z 0 is contained on the boundary of [D] Sq>20 . 
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Using this result we will prove Theorem 2.3 later in the section. 

7.1. Some elementary tools. Let us review some notions from convex geometry (see for 
example, [Roc70]) and linear algebra. 

Definition 7.2. Suppose that A is a convex subset of T|M and p e G A d . Then, the strict 
normal cone of A at p e and normal cone of A at p e are defined by 

N p e M) : = {qe TfM \ g io (q,p- p e ) <0, Mp / p e G A}, 

N Pe (*4) : = {g£ TfM | gx 0 ( q,p - p e ) < 0, Vp G A}. 

If Np e (A) is nonempty, p e is called an exposed point of A. 

Remark 7.3. It is well known, N Pe (A) and AT (A) are convex cones. Also N Pe (A) is closed, 
and contains 0 and at least one nonzero vector for any p e G A®. 

7.2. Tilting and chopping. The proof of Theorem 7.1 goes by a contradiction. If So has 
more than one point and also contains an interior exposed point (when seen in cotangent 
coordinates), then one may find sections St := {u < rrit } (t small and positive) with a geometry 
that contradicts the combined estimates from Theorem 2.1 and Theorem 2.2. The sections St 
will be obtained by adequately “chopping” the original contact set So with a family of G-affine 
functions rrit which are obtained by “tilting” the original function m o- 

The next two lemmas deal with the selection of the family of G-affine functions rrit■ We do 
not yet need the fact that u is an Aleksandrov solution here, just the fact that it is very nice. 

Lemma 7.4. Let mo(j := G(-,cc 0 , 20 ) be a G-affine function supporting to u somewhere in 12 
with xo G f2oi and define 

S 0 : = {u = mo} ■ 

Also, suppose p e is an exposed point of [So]x 0 , 2 0 , that e 0 G N pe ([So]s 0)2o ) ' s unit length, and 
So contains at least two points. Then for any fixed 5 > 0 there exists a family of nice G-affine 
functions Q , (depending on So and e 0 ), such that for all small enough t > 0 we have 

mo(x e ) = u{x e ) < m s t (x e ), (7.2) 

[‘S(5,t]so,zo U R>5 (Pe); (7-3) 

u N <mt(x ) < %[, V x G 12. (7.4) 

where S$ t t ■= {u < mf}. 

Proof. Let us write 

x e := exp s 0 , Z0 (Pe), Po ■= P Xe ,u(x e )(x o), 
and note that since mo is supporting to u at x e we have 

z 0 = H(x e ,x 0 ,u(x e )). 

We will now define mf. Note that by (G-Nondeg), we have E(x e ,xo,zo)eo^ / 0 (see Definition 
5.2 for the definition of e 0 ^). Since fig 1 C 12 mt , for t > 0 sufficiently small, po + t-E(x e , xq, zo)e 0 ^ 
remains in [l2 ] Xf ,, u (xe)i hence 

x{t) := exp® |1l(Xe) (po + tE(x e , x 0 , z 0 )e 0 tt ) 
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is a well-defined G-segment for such t (we comment here that the smallness of t does not need 
to be very nice, in fact it is allowed to depend on x e , So, and e 0 , and we may have need to take 
it smaller later in this proof). Also define 

z(t) : = H{x e ,x(t),u(x e )) = Z^(p 0 + tE(x e ,x 0 , z 0 )e 0 i ,u(x e )), 

z s (t) : = H(x e ,x(t),u(x e ) + Tst) = Z^ e (po + tE(x e ,x 0 , z 0 )e 0 i ,u(x e ) + T S t), 

for some small ts > 0 to be chosen later. Now we consider the G-affine functions 

m 5 t (x) := G(x,x(t),z s (t)), 

note (7.2) follows immediately. First, a simple compactness argument yields 

C ([Sc)]so,zo) C^-5) 

for some r(t) = o(l) as t —> 0 (with ts fixed), while again a compactness argument along with 
the inclusion e 0 6 Np e ([«So]:zo,zo) gives existence of a t$ > 0 such that 

[<So]* 0l ao n {p G T| 0 M | gx 0 {p - p e , e 0 ) > -t 5 } C B 5/2 (p e ). (7.6) 

Next since u is very nice, we see that if ts is sufficiently small, then [u{x e ), u(x e )+Tst] C [u N , un]- 
Thus by using the mean value property as in the calculation of (6.1) there exists a very nice 
Gi > 0 such that 

I G{x, x(t), z s (t)) - G(x, x(t), z(t))\ < CiTst , (7.7) 

and in turn if mo(x) < mf(x) we have 

0 < G(x, x(t), zs{t)) - G(x, x 0 , z 0 ) 

< C\Tst + G(x,x(t),z{t)) - G(x,xo,zo). 

At the same time, since x(s) remains entirely in 0, by (DomConv*) and u is very nice, the 
quantity — G z (x, x(s), ^(s)) is bounded away from zero and infinity by a very nice constant. 
Thus for some very nice C > 0, (also using (4.2), (4.3)) 

—C\Ts < t -1 [ —G(x, x(s ), z(s))ds 

Jo ds 

rt jpQ jjQ 

= t _1 / (-G z (x,x(s),z(s))){~ — (x,x(s),z(s)) + — (x e ,x(s),z(s)),x(s))ds 

Jo ^Z ^z 

< Ct~ l [ {Px(s),z{s){ x ) - Px(s),z(s)(xe),E- 1 (x e ,x(s),z(s))E(x e ,xo,zo)e 0 i )ds 

Jo 

= C{p^ s , )tZ ^ ) (x) - p s ^ ) ^ ) (x e ),E~ 1 (x e ,x(s , ),z(s'))E(x e ,xo,zo)e 0 i ), (7.8) 

for some s' G [0 ,t]. We pause to note here, s' is determined by t, thus this last expression can 
be viewed as a family of functions in the variable x G fl cl , parametrized by t > 0. By the G 2 
assumption we have on G, as t approaches 0 the expression converges uniformly in x G to 
the quantity 

{Px 0 ,z 0 (x ) - p So ,z 0 {x e ), E~ 1 (x e ,xo,zo)E(x e ,x 0 , z 0 )e 0 i ) = px 0 ( Px 0 ,z 0 (x ) -p e ,e 0 ). 

As a result, first taking ts small, we have for all t > 0 small, the inclusion 

[{m 0 < rn t s}] 5 io , zo C jp G T^M \ g So {p - p e ,e 0 ) > -y | . 
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Since mo is supporting to u we have 

S s ,t C {m 0 < m^} , 

combining with (7.5) and (7.6) we obtain (7.3). 

Finally, this last argument of uniform convergence shows that if t is taken sufficiently small, 

sup|G(y, x(t), z(t)) — mo(y)\ is small, (7-9) 

yen 

hence combined with (7.7) and the fact that u is very nice , we can ensure (7.4) holds. □ 
In the next lemma the notation is again used (see Definition 5.1). 

Lemma 7.5. Let mo, So, p e , and x e be as in Lemma 7.4 above, and suppose x e G O mt . Then, 
for each 5 > 0, we can find (each of the following depending on So), some to > 0, a family of 
G-afhne functions mt(-) := G(-,xt, zt), a family of unit length e t G Tf t M defined for t G [0, to], 
and some eo > 0 which satisfy the following for all t G [0,to]: 


m t (x e ) > u(x e ), 


nun 


m t (x e ) ~ u(x e ) 

sup (mt — u) 
St 


m t (x e ) - u(x e 


sup (mt — u) + u(x) — mt(x) 
St 


lim 

4-5-0+ 


d(pxt,zt ( x e)i n 


t un * 

[St]x t ,z t [St]x t ,z t/ 


lim m t (x e ) =u(x e ), 
t\0 

(7.10) 

, C B S (p e ), 

(7.11) 

) < %f, ViGfl, 

(7.12) 

> e 0 , Vx G S 0 \ [J S t 

(7.13) 

te(o,4 0 ] 


= 0,. 

(7.14) 

a unit length e 0 G N° e ([S , 0 ]x 0 , Zo ) and 


K[St\x t ,zti e t) 

Here we have written St := {u < mt}. 

Proof. Fix 5 > 0. By [GK15, Lemma 7.4], there exists a unit leng 

Ao > 0 such that p e — Ae 0 G [So]s 0iZO for all A G [0, Ao]. Let mt ■= mf be obtained by applying 
Lemma 7.4 with this choice of e 0 and 6, and we may assume both that to < Ao and to is small 
enough to obtain all the properties detailed in Lemma 7.4 when t < to- Also let xt := x(t) and 
Zt ■= zg(t) as defined in the proof of Lemma 7.4 above. Then (7.2) immediately implies (7.10), 
(7.3) implies (7.11), (7.4) implies (7.12), and each mt is nice. 

Now we will show (7.13). First, by (7.7) and a calculation similar to (7.8), Cauchy-Schwarz, 
(G-Nondeg), and Lemma 4.30 we find a very nice C > 0 for which 


sup (mt - mo) = sup [G(y, x t , z t ) - G(y, x 0 , z 0 )] 
si yen 


< Ct( 1 + Tfi). 


Recalling that mo < u, we obtain 

m t (x e ) - u(x e ) m t (x e ) - u(x e ) 

sup (mt — u) ~ sup (mt — mo) 
St St 

> Tst 
Ct( 1 + Tfj) 

> n . 

C( 1 + Ts) 
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Next note that since x 0 Ute(o t 0 ] St the denominator of the second expression in the minimum 
in (7.13) is always strictly positive. Then since a; £ So we have 

sup (nit — u) + u(x) — mt(x) < sup(m t — mo) + mo(x) — mt{x), 

St S t 

by an argument much as above we obtain (7.13) for the choice 

n 

60 ~ 2(7(1 + t s ) ' 

We now work toward showing (7.14), to this end take any x cp £ So- Recalling (7.9), we can 
apply (4.11) in Lemma 4.32 and use the mean value theorem as in (7.7) to find a very nice 
C > 0 such that 


m t (x cp ) - u(x cp ) = mt(xcp) - m 0 (x cp ) 

= G(x cp , x(t),zs(t)) - G(x cp , xq, zq) 

> G(x cp , x(t),z(t)) - G(x cp , xq, zq) + Crst 

(j 

> Jj(Px 0 ,z 0 (x C p) - Pe, E~ 1 {x e ,xo, zo){p Xe ^ Xe] {x{t)) - p 0 )) + Cr s t 

C u 

= tjj{ p x 0 ,z 0 {x C p) — Pei £q) + Crst 

c 

= t-j^jpxo ( Px 0 ,z 0 (Xcp) — Pei^o) 4" Grst. 

Since So contains at least one point besides x e and [<S , o]x 0 ,z 0 convex by Proposition 4.28 (recall 
mo is assumed nice), we may choose x cp € So, x cp ^ Xe in such a way that the final expression 
in the above calculation is always nonnegative. In particular 


Xcp £ St, t £ [0, to]’ 


(7.15) 


Finally, we define 


e 


t 


Px t ,zt(eW < i a ,z 0 (Pe + loe 0 )) - Px t ,zt(x e ) 
\Pxt,z t (eW ( g 0 ,z 0 (Pe + l 0 e 0 )) - Px t ,zt(Xe)\g St 


for some sufficiently small lo > 0 such that the above expression is defined. Suppose by con¬ 
tradiction that (7.14) fails, then there exists e > 0 and a sequence of tk > 0 going to zero such 
that 


e < 


a([Stk ,Zki e k> 


V k 


(7.16) 


where for ease of notation, we write Sk := S tk , Xk := x t> ., Zk := z th , p k e := Px tk ,z tk {x e ), and 
e k := e t . By compactness, we can assume all of these sequences converge, it is clear that 
Xk ~> xq, Zk —*■ zq, and e fc -> e 0 and p k —> p e (in T*M). Now we can see that 


Jim l{[Sk]x k ,z k ,e k ) = K[So}x 0 ,z 0 ,e 0 ) > t 0 > 0 (7.17) 

K—¥ OO 


by our choice of e 0 . Now recalling Remark 5.3, we obtain the existence of a sequence p k £ 
[ S k \x k ,z k such that for all k 
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by compactness of H cl we may assume that exp < i k:Zk {pk) converges to some Xoo G H cl as k oo; 
we easily see Xoo G Sq- Then combining with (7.17), rearranging (7.16), and passing to the 
limit, we would obtain 

0 < et 0 < gx 0 (Px 0 ,zo(xoo) ~ Pe, e 0 ). 

However, as e 0 G N® e ([<So]e 0 ,zo) this implies Px 0 ,z 0 (x oo) = Pe immediately giving a contradiction. 
Thus we have shown (7.14) finishing the proof. 

□ 

7.3. Proof of Theorem 7.1. From this point on, the rest of the proof is analogous to the 
argument in [GK15], specifically the proofs of [GK15, Theorem 5.7], and [GK15, Lemmas 5.8 
and 5.9], using Lemma 7.5 in place of [GK15, Lemma 5.5]. 

Some points of note. The sets spt p and spt p, and d c u from [GK15] should be replaced by 
Ho, Ho, and 8qu respectively, while Theorem 2.1 and Theorem 2.2 should take the places of 
[GK15, Theorem 4.1, Lemma 3.7]. By (7.12) and (7.11) (choosing a small enough 5 > 0), we 
can apply Theorem 2.1 to the sections St when t is sufficiently small from Lemma 7.5. Also we 
see that by (7.13) and (7.12) we will have 

supmt + sup (mt — u) < ?z N + Cq 1 (mt(x e ) — u(x e )) 

S t S t 

thus by the second part of (7.10), for t > 0 small enough we obtain (2.2); hence we can also 
apply Theorem 2.2. 

Finally, the set S '] 318 appearing in the proof of [GK15, Lemma 5.9] should be redefined as 

S) bls := {x G H | u{x) < G(x,xt,H(x 0 ,xt,u(x 0 )))}, 

where m*(-) = G(-,xt,zt) and for some choice of xq 0 St- Here, we note that as in (7.7), we 
have 


| G(x,x t ,H(x 0 ,xt,u{x 0 ))) - m t (x)\ < C(u(x 0 ) - m t {x 0 )) 

thus combining with (7.12) and choosing xo close enough to the boundary of St, we can ensure 
n N < G(x, xt, H(xq, xt, u(xq)) < Fn and (2.2), for all x G H and t > 0 small. With this choice 
of xo, we are able to apply Theorem 2.2 to G(x,xt,H(xo,xt,u(xo)), and the proof of [GK15, 
Lemma 5.9] can now be followed. 

□ 

7.4. Strict convexity. For the remainder of this section we fix xq G Hq 1 *, xq G dcu(x o), and 
also write 


z 0 := H(x 0 ,xo,u(x 0 )), m 0 (-) := G{-,xq,z q ), 

P0 ■ Pxo,zo (^o) i P0 ■ Pxq,u(xo){xq) Pxo,?ntj(xo) (*^o) • 

Additionally, in this section we will be using the Riemannian inner product g XQ (•, •) on T* Q M. 

Lemma 7.6. Suppose that the conditions of Theorem 2.3 hold and Sq contains more than one 
point. Then there is some nonzero % G T* Q M such that 

( B Ap o) \ Br(p o)) n Ip 0 (qo,r ) C [Hq]^^) 


(7.18) 


Pointwise estimates and regularity in geometric optics and other generated Jacobian equations 


43 


for all sufficiently small and positive r. Here, Ip 0 (qo,r) denotes the cone 


Ipo (.QOi r ) 


P e [H] 


XO,u(xo) I ^9x 0 


^ P-Po , 



> \Kq±(p-Po)\ 



(7.19) 


and 7Tgj_ (p) is the projection of p onto the (n — l)-dimensional affine space containing po, which 
is g XQ -orthogonal to %■ Moreover, the linear function on [n] X0!Z0 defined by 


Kp) ■= (p, 


E l (xo,x 0 ,z 0 )q 0 , 

1901 g XQ 


attains a unique maximum on [So] xo 


(7.20) 


Proof. This proof is essentially identical to that of [GK15, Lemma 6.3]. □ 

Lemma 7.7. Suppose % G chosen as in Lemma 7.6 above, l{p) is defined by 

(7.20), and So contains more than one point. Then ifp max £ [So] xo ,z 0 is the unique point where 
l(-) attains its maximum over [So]x 0 ,^o; we have 

Pmax £ [So]s 0 , 2 0 n (7-21) 

Additionally we have the inequality 

inf l(p xotZ0 (x)) > l{p max ) - o(l), r-^0, (7.22) 

X 

where for each r > 0 small, the infimum is taken over the set of x 0 So satisfying 

[d G u(x)\ X0>u ( Xo ) n Ip o {q 0l r) n (B r (p 0 ) \ B r/2 (Po)) + 0- 

Proof. Since the maximum of a linear function on a convex set must be attained at at least one 
of its extremal points, p max must be an extremal point of [So] XQ . However, since So contains 
more than one point by assumption, Theorem 7.1 yields (7.21). 

We now work towards the inequality (7.22). Fix some ro > 0 to be determined, take r G 

(0,r 0 ), and let x 0 S 0 . Also suppose p r G [d G u (x)] X0jU ( X0 ) nI Po (q 0 ,r ) fl ( B r (p 0 ) \ Br(p 0 )j and 
define 

P ■ = Px 0 ,zo(x), x r := exp XoMxo) (p r ), J'max : = ex P s 0 , 2 o(P max) •> 

x(t.) : = [® 0 ,®r] X0jU ( X0 ), z(t) := H(x 0 ,x(t),u(x 0 )) = H(x 0 ,x(t),m 0 (x 0 )), 
m r (-) : = G(-,x r , H(x , x r , u(x))). 

Now since u is very nice , mo(-) £ [mnj^n] on H cl . Then by (DomConv) we can apply (4.12) to 
find some very nice constant C > 0 such that 

|G(x max , x(t), z(t)) - mo( J-max )| < Ct\p max -p\g* 0 \Pr ~Po\g* 0 < Cr 0 . 

In particular, if ro is sufficiently small we must have (x max , x(t), z(t)) G g for all t G [0,1], for 
any choice of p r and x. Next since m r is supporting to u, we must have for some very nice 
constant C > 0 that 


0 — ^(^mait) Wol^mas) 

A m r (x m aX ) ®o(^ma x ) 

— G(x max , X’ r , H(x o, X r , 77l r (xo))) W,o(Xma x ) 

> -C(mo(x 0 ) - m r (x 0 )) + G(x max , x r , H(x 0 , x r , u(x 0 ))) - G(x max ,x 0 , H(x 0 ,x 0 ,u(x 0 ))). 
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Here we have used the fact that u(x o) = mo(xo), and the very nice C arises once again from 
using the mean value theorem and the facts that both mo and m r lie in 

Note that since m r (x) = u(x) > mo(x) while m r (x o) < u(x o) = mo(xo) and H is assumed 
path-connected, there exists some x' Q G H such that 

m r ( x'q) = m 0 (x o). 

Thus by (DomConv) again we can apply (4.12), and calculate 

\m 0 (x 0 ) - m r (x 0 )| = \G(x 0 ,x r , H(x' 0 ,x r ,mo(x' 0 ))) - G(x 0 ,x 0 , H(x' 0 ,x 0 ,m 0 (x' 0 )))\ 

— C\Pxo,H(x' 0 ,xo,m 0 (x' 0 ))( x o) ~~ Pxo,H(x' 0 ,xo,mo(x' 0 )){ x o)\9x 0 \Px' 0 ,mo(x' 0 )( x r) ~ Px' 0 ,mo(a 
< C\pr -po\g XQ < Cr, 

where we have used the bound on mo, (4.9), (4.8), and boundedness of H to obtain the final 
inequality. We may then apply Lemma 4.32, (4.11) to conclude 


Cr > (p 

max Pi E 1 (x 0 , X 0 , Zq) (p r ~ Po)) 


_ / - - qo 

— VPmax Pi 9 xq \ Pr POi \ 

\ \qo\gx 0 


E 1 (x 0 ,x 0 ,z 0 )qo 


— 9xq I Pr POi | - 


qo 


\qo\gx 0 

l (Pmax -p)~ C\lT q ± (pr ~ Po)\g XQ 


+ E (xo,Xo,Zo)TT q ±(Pr - Po)) 


(7.23) 


\qo\g xo 

again for some very nice C > 0. We now prove that 

9x 0 (Pr ~ Po, qo) > 0. 

Indeed, g XQ (p r -po,qo) > 0 as p r G Ip 0 (qo,r), but g XQ (p r -po,qo) = 0 would imply p r = p 0 

, rearrange, 


which would contradict p r ^ B r / 2 (po)- Thus we may divide by g Xo ( p r — po —— 
and use that p r G Ip 0 (qo,r) to obtain 


1901 g X Q 


l(p)>l (pmax) - C 

> l (Pmax) - Cr, 


Kq±(Pr ~ P0)\g XQ 


9xo l Pr PO, |g 0 | 



- Cr 


proving (7.22). 


□ 


Corollary 7.8. Suppose that the conditions of Lemma 7.6 hold. Let qo G T* Q M and p max £ 
[Sol.xo. 2 o H [f2]^ ZQ satisfy the conclusions of Lemma 7.7, and let Ip 0 (qo, r) be as defined by (7.19). 
Then given any e > 0, there exists r € > 0 such that for any x G H cl \ So satisfying 


[d G u (x)] X0)U ( X0 ) n Ip 0 (qo,r e ) n {B Te (po) \ B^.(p 0 )j + 0, 


we must have 


\Pxo,zo ( x ) Pmax| g XQ < 


e. 


Proof. Let l(-) be defined by (7.20). The proof is by a compactness argument. Suppose by con¬ 
tradiction that the corollary fails, then for some eo > 0, there is a sequence of r^ > 0 decreasing 
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to 0 as k —^ oo, and sequences 
such that 


Pk £ \d G U ( x k))]xo,u(xo)i 

\Px 0 ,zo i x k) ~ Pmax\gx 0 > e 0 (7.24) 

for all k. It is clear that p k — > po as k —> oo, and by the compactness of [Q]|* o 2o a subsequence 
x k —> %oc for some Xoo G Q cl . Writing 

®max := exp^(Pmax), x k := exp x 0tU ( Xo) (Pk), 

since x max G So we calculate 

^JJ'O(^max) = ll(^max) ^ G(x max , Xk,H(x k ,x k ,u(x k ))) 

7 G(x max , Xo, ^/(Xoo, Xo, ^(Xqo))) 

as k —> oo. Then taking G{x co , xo, 77(x max , xo, •)) of both sides we have 

^o(a?oo) — ^(xoo, XO) 77(x maxj xo, rrto(x max ))) 

> ^(Xoo) 

or in other words, Xqo G So- However, since X& satisfies (7.22) with r = r k , taking k —>• oo 
implies l(px 0 , 2 0 (xoo)) > l(pmax)- Now by (7.24) we have ps 0 , 20 (^ 00 ) 7 ^ Pmax, which contradicts 
the uniqueness of p max as the maximizer of Z(-) over [So]x 0 ,z 0 . □ 


7.5. Proof of Theorem 2.3. Suppose the theorem fails and the contact set So contains more 
than one point. Since ffo is compactly contained in fl, we take e such that 

0<e<d([n 0 k,*d.[«]fo,*o)- ( 7 - 25 ) 

Next take % G T* Q M obtained from applying Lemma 7.6, and r e > 0 associated to our choice 
of e by Corollary 7.8. Then by (7.18) there exists a set 4 C fio such that 

[d G u (H)] 3 . 0iU(3 . o) = I Po (q 0 ,r € ) n ^B re (p 0 ) \ Hix(po)) , 

while by Corollary 7.8 and (7.25) we must have 

A c So n (b cl \ u 0 ) • 


Then since u is an Aleksandrov solution (and also recalling (4.10)), for some constant C > 0 
depending on [u n ,un] we have 


0< Ip 0 {qo,r e ) C (B re (p 0 ) \ B*± 

< c|An n 0 \ c 
<c\s o \ c = 0, 


c 


thus finishing the proof by contradiction. 

□ 
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8. Engulfing property and Holder regularity for the gradient 

8.1. Engulfing property of sections. In this section we shall prove Theorem 2.4. We follow 
here a method first introduced by Forzani and Maldonado to obtain explicit C l,a bounds for 
the real Monge-Ampere equation [FM04]. This was later adapted by Figalli, Kim and McCann 
to prove C l,a regularity of the potential in optimal transport [FKM13a, Section 9]. 

For x £ fig* x £ Oqu ( x), and h > 0, we will use the notation 


S(x, x, li) 

= {y £ H | u{y) < m h (y)} , 

(8.1) 

m h (-) 

= G{-,x,z h ), 

(8.2) 

Zh 

= H(x, x, u(x) + h). 

(8.3) 


We comment here that since u is assumed very nice, for any h > 0 sufficiently small we will 
have mh £ [Unj^n] on all of f 1. Additionally, Theorem 2.3 implies we may assume the section 
S(x,x,h ) is contained in a ball of arbitrarily small diameter, entirely contained in Qfj 1 *; also 
condition ( 2 . 2 ) will hold on any subset of S(x, x, h ). Asa result we can apply Theorem 2.1 to the 
sections S(x, x, h ) as long as h is small, and Theorem 2.2 to any AcS(i, x, h) satisfying (2.1). 
Furthermore, as u is an Aleksandrov solution this implies that for any subset A C S(x,x,h) 
we will always have \A\^ ~ \dou{A)\^. We point out here that the strict G-convexity of 
u, Theorem 2.3, is essential here, as it allows us to actually apply our Aleksandrov estimate 
Theorem 2.1 to all sections with small enough height. 

Since later, we will be concerned with a dilation of the section S(xo,xo,h ) with respect to 
Pxo,z h ( x o) (instead of the center of mass of the section), we begin with a preliminary result 
showing that Px 0 ,z h ( x o) is actually fairly close to the center of mass. 

Proposition 8.1. There exists a very nice 7 £ (0,1) and ho > 0 such that for any h £ (0, ho), 

Px 0 ,z h (xo) G l[S(x 0 ,x 0 ,h)\x o ,z h , 

where the dilation above is with respect to the center of mass of [S(xo,xo,h)]x 0tZh - 

Proof. We will write Sh '■= S(x 0 , xo, h) for the duration of this proof. 

Let us define 

t 0 ■■= inf {t £ [0,1] | Px 0 ,z h (x 0 ) £ t[S h ]x 0 , Zh } , 

our goal is then to prove to < 7 < 1 for some very nice 7 ; note we may assume, say, to > 
^ otherwise we are already done. Then by combining Theorem 2.1 and the main result of 
[FKM13b], (and recalling that u is an Aleksandrov solution) we obtain a very nice Co > 0 such 
that 

(m h (x 0 ) - M(x 0 )) n < c 0 | S h \ 2 c (1 - t 0 )5 S=T . (8.4) 

On the other hand, (as we have done many times) by the mean value theorem applied in the 
scalar parameter, if h < ho with some small, very nice h 0 > 0 then for some very nice C we 
can see that 

m h (x) = G(x,xo,H(xo,xo,u(x 0 ) + h)) 

< G(x, xo, H{xo,xq, u(xq))) + Ch = mo(x) + Ch, 

here the fact that u is very nice allows us to assume C is also very nice. Since mo < u 
everywhere, the above leads to 

sup (mh — u) < Ch = C(mh(x 0 ) — u(x 0 )). 

Sh 
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By shrinking ho further if necessary, with a very nice dependance, and using that u is very 
nice , we see mh and A = exp!? ((KM)~ 1 [Sh]x 0 ,z h ) will satisfy condition ( 2 . 2 ). Thus applying 
Theorem 2.2 and combining with the above inequality yields another very nice ci > 0 such that 

(m h (x 0 ) - u(x 0 )) n > ci \S h \c ■ 


Combining the above inequality with (8.4), it follows that 


< l ~t 0 


to < 1 - ( - 
CO 


( \ 2n_1 

Thus the proposition is proven with the choice 7 := 1 — ( ^ ) < 1, which is also seen to be 

very nice. □ 


The next lemma proves a rather strong property of a solution u: sections of different heights 
are roughly homothetic to one another (in cotangent coordinates). The lemma uses in a crucial 
way the strict G-convexity of u, which guarantees that S(x, x, h) is contained in a neighborhood 
of h when h is small enough (as discussed at the beginning of the section). 

Lemma 8.2. There exists a very nice constant P £ (0,1) such that 

[S(x, x, h)\x,z 2h C P[S(x, x, 2 h)]x,z 2h , V h £ (0, ho). 

Here the dilation P[S(x,x,2h)]x lZ2h is with respect to Ps,z 2h {x o)- 

Proof. Clearly it suffices to show that if Px,z 2h {y) £ [S(x, x,2h)\x tZ2h \ (P[S(x,x,2h)]x, Z2h ) then 
y 0 S(x,x,h). 

We now work toward this claim, assume we have such a y. At the end of the proof, we 
will wish to take /3 close to 1 , hence there is no harm in assuming from the start that P > 

In particular, by Proposition 8.1 above we can see that Px 0 ,z 2h {y) is outside of some very nice 
dilate of [5(x,x, 2 h)]x,z 2h with respect to its center of mass. Thus we can once again combine 
Theorem 2.1 and the main result of [FKM13b] to obtain a very nice constant C such that 

m- 2 h{y) - u{y) < C{ 1 - P)^ 1 \S(x,x,2h)\ 2 / n . 

On the other hand, as in the proof of Proposition 8.1 above, for h < ho we may apply Theo¬ 
rem 2.2 to obtain \S(x,x,2h)\ c < C(2h) n / 2 , thus combined with the above yields 

m 2 h{y) ~ u(y) < C{ 1 - P)^ T h. 

Rearranging, we have 

u(y) > m 2h {y) ~ C{ 1 - P)^ T h 

= G(y, x,H(x, x, u(x) + 2 h)) — 2C(1 — P) ^ n ~ 1 h 
> G(y , x,H(x, x, u(x) + h)) + coh — 2C'(1 — P) 2 ’ 1 - 1 h 
= rn h (y ) + (c 0 - 2 C (1 - P)^= T )h 

where again, possibly shrinking ho in a very nice manner, again the mean value theorem yields 
the third line above, with a very nice constant cq- Finally by taking 1 — P small enough we 
have u(y) > mh{y), it then follows that y fL S(x,x,h). □ 
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The lemma above says that sections are comparable under affine rescaling (in the right 
system of cotangent coordinates). The fact that sections are comparable in this manner is a 
very strong assertion for a G-convex function, and it will imply explicit G 1,a estimates (as well 
as G-convexity estimates). 

The next lemma is an analogue of the “engulfing property” (see [FM04, Theorem 4] and 
[FKM13a, Theorem 9.3] for the Euclidean case and the optimal transport case, respectively). 

Lemma 8.3. There are constants A > 1, to > 0 with the following property: if xq,x± G Ho 
and xo,xi G Qq are such that x\ G S(xq, xo, h ) for some h < ho, then 

xo G S(xi, x\, Ah) 

Proof. Let xf G S(xq, xq, 2h) d be such that the G-segment x(s) := \xq,x^\- contains the 
point x\ = x(si). Then Lemma 8.2 implies that si < (3. We now claim that for some very nice 
G > 0, 

G(xi,xi,H(xo,xi,u(xo) + 2 h)) < m 2h (x i) + Ch. 

Indeed, if G{x\,x\,H(xq,x\,u{xo) + 2 h)) < m 2 h{xi) we are already done. Otherwise we note 
that since u is very nice , we may apply (G-QQConv) with s = s' = s\, zq := z 2 h , and xf in 
place of x\ there to conclude 

G(xi,xi,H(xo,xi,u(x 0 ) + 2 h)) - m 2h (x i) 

< ■^ 1 (G{xl,xi, H(x 1 ,x 1 ,m 2 h{x 1 ))) - m 2h (xf^j 

< (G(xf,xi,H(xi,xi,m 2h (x 1 ))) - m 2h (xfj^ . 

Then since m 2 h(xi) — u{x\) < 2 h, for small enough h we can again use the mean value theorem 
to find a very nice C such that 

G(xf, xi, H (xi, xi, m 2 h(xi))) - m 2h (xf) 

= G(xf,xi,H(xi,xi,u(xi))) - m 2h (x f) + C(m 2h (x i) - u(x i)) 

< u{xf) — m 2 h(xf) + Ch < Ch, 

proving the claim. 

Using this claim, the mean value theorem again, and that xq G Oqu (xo) we have a very nice 
A > 0 such that 

G(xi,xi,H(xo,xi,u(xo) + 2 h)) < G{x\,xo,H(xo,xq,u{xo) + 2 h)) +Ch 

< G(x i,x 0 , H(xq,xq,u(xo))) + Ah 

< u{x i) + Ah. 

Finally, applying G(xo,xi,H(x\,x\,-)) to both sides of the above inequality and using the 
monotonicity of this function in the scalar variable, we obtain 

u(x o) < u(x o) + 2 h 

= G(x 0 ,xi,H(xi,x 1 ,G(xi,x 1 ,H(x 0 , xi,u(x 0 ) + 2 h)))) 

< G(xq,xi,H(xi,xi,u(xi) + Ah)), 

(as long as h is sufficiently small in a very nice manner, the expression u{x i) + Ah is indeed 
contained in the domain of G{xq, x\, H[x\, x\, ■))). This proves xq G S(xi, x±, Ah). □ 
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Lemma 8.4. Let A be as in Lemma 8.3. There exist very nice Ai > 0 and do > 0 such that if 
d g (x o,xi) < do , xo G &gu(x o ), and x\ G &gu(x i ), then 

~^-{u(xi) - G(x i, x 0 , H(x 0 , x 0 , u(x 0 )))) < u(x 0 ) - G(x 0 , x 1 ,II(x 1 ,x 1 ,u(x 1 ))). (8.5) 

Ai 

Proof. For e > 0 let us write 

t € := G(xi,xi,H(xq,xi,u(xo) + e)) - u(x i) > 0. 

We will eventually take e \ 0, so we may assume it is as small as we want, additionally 
if we assume that d g (xo,xi) < do for some do > 0 depending on the Lipschitz norm of u 
(which in turn, is controlled by the constant Kq in (Lip^ ), recall Remark 4.19), we will have 
u(x 0 ) + A t € G [u N , un] ■ Then 

G(xo,xi,H(xi,xi,u(xi) + r e )) = G(x 0 ,xi, H(xi,x\,G(xi,xi, H(xq,xi,u(x 0 ) + e)))) 

= u(xo) + € 

hence xo G S(xi,xi,r e ), thus by Lemma 8.3 we must have x\ G S(xq, xo, Ar e ). This means by 
the mean value theorem, for a very nice C > 0 we have 


u(xi) < G(xi,x 0 , H(xo,x 0 ,u(x 0 ) + Ar e )) < G(xi,xo,H(x 0 ,xo,u(x 0 )) + CAr e , 
or rearranging and taking e to 0, 


1 

CA 


(u(x i) - G(xi,x 0 , H(x 0 ,x 0 ,u(xo))) < G(xi,xi,H(x 0 ,xi,u(x 0 ))) 


u(x i). 


Now by using the mean value theorem again, we calculate 
G(xi, xi, H(x 0 , xi, u(x 0 ))) - u(x i) 

= G(xi,xi,H(x 0 ,xi,u(x 0 ))) - G(xi,xi,H(xi,xi,u(xi))) 

= G(xi,xi,H(x 0 , xi, u(x 0 ))) - G(xi,xi,H(x 0 , x 1 , G(x 0 , x lf H(x 1; x lf u(x 1 ))))) 
< C(u(x 0 )) - G(x 0 ,x 1 ,H(x 1 ,x 1 ,u(x 1 )))). 


The constant C in the final inequality above can seen to be very nice for the following reason: by 
possibly shrinking do depending on Kq in (Lip Kq ), we can ensure that G(x o, xi, H(x i, x±, u(x\))) 
is sufficiently close to u(x o), since u is very nice we can ensure C is also very nice. Combining 
these above two inequalities yields (8.5) for Ai = CA. □ 


8.2. Proof of Theorem 2.4. Fix a point xq G flQ nt and an xq G Oqu (xo). We shall now show 
that for some very nice C > 0, 

u(x) - G(x,x 0 ,z 0 ) < a _ p d g (x,x 0 ) 1+/3 

for all x in a small neighborhood of xq, where f3 < a and (3 < A)) 1 , where Ai is the constant 
in (8.4). As G(-,x,z) is uniformly C 1,a in x, the C regularity of u follows by a standard 
argument. 

Fix an x\ with d g (x o,xi) < do and let x g (s) be the (unique) unit speed geodesic from xo 
to x\. The engulfing property, used via Lemma 8.4, will lead us to a differential inequality for 
<f>(s), where 

4>(s) := u(x g (s)) - G(x g (s),xo,zo). 

To make the idea of the proof clear, let us go over it first in the special case where u G C 1 . 
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Proof when u is C 1 . Let us define 

x s := d G u(x g (s )), 

:= H(xg(s),x s ,u(x g (s)))-, 

note as u is C 1 , the set d G u (x g (s)) is actually single valued for each s. Differentiating cj) (u is 
C 1 , and the chain rule applies) and using x s , z s as defined above, 

(j)'(s) = (Du(x g (s )) - DG(xg(s),x 0 ,z 0 ),Xg(s)) 

= (DG(x g (s),x s , z s ) - DG{x g (s),XQ, zq)),x 9 {s)). 

Now, with a C given by the C' 1, “-norm of G with respect to the first variable (and uniformly 
in the other two), 

G(x g (s),x 0 ,z 0 ) < G(xo, xo, zo) + s(DG(x g (s), x 0 , z 0 ), x g (s)} + Cs 1+a 
G(x g (s),x s , z s ) > G(x 0 , x s , z s ) + s(DG(xg(s),x s , z s ),x g (s)) - Cs 1+a . 

Thus 


s(DG(x g (s), x s , z s ) - DG(x g (s),x 0 , z 0 ),x g (s)) + 2 Cs 1+a 
> G(x g (s),x s , Zs) - G(x g (s),x o, zo) - G(x 0 , x 3 , z s ) + G{x 0 , x 0 , z 0 ) 
= u(x g (s)) - G(x g (s),x 0 , zo) - G(x 0 , x s , z s ) + u(x 0 ) 

= 4>(s) - G(x 0 ,x s , z s ) + u(x o). 


On the other hand, by Lemma 8.4, 

u(x 0 ) - G(x 0 , x s , z s ) > A^ 1 (u(x g (s)) - G(x g (s),x 0 , z 0 )) 

= a rV(s), 

thus by combining the above and rearranging, we have 

s4>'(s) — (1 + A)“ 1 )(()(s) + Cs 1+a > 0. 


Using the elementary identity, 

d_ 

ds 


(j)(s) 


S l+P j ~ -j2+0 “ 0 + W( S )) 


with any choice of f3 such that (3 < a and (3 < A 1 1 yields 


d_ 

ds 


Qi+/3 


+ Cs *- 0 - 1 > 0 . 


In particular, for any s < d g (x o,*i), by integration we obtain 

(f){d g (x 0 ,xi)) + Cdg(xo,xi) a ~ p > 4>(s) + Cs a ~ 13 > cj)(s) 


a — (3 


s i+p 


a - /3 s 1+ ^' 


( 8 . 6 ) 


d g {xo,x i) 1+/3 

At the same time, (f> is bounded by a very nice constant due to the fact that u is very nice. 
Thus rearranging (8.6) we have for some very nice C > 0 that 


cj>(s) < C 


1 


+ 


1 


T+/3 


a ~P dg(x 0 ,x l) 


1+/3 


< 


Cs 1+ P 


(a - /3)d g (x 0 ,x i) 


1+/3- 
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in terms of u this says that if x lies on the geodesic connecting xo to xi, then 

C 


0 < u{x) — G{x, xq, zq) < 


id g (x,x 0 ) 1+/3 . 


(a - /3)d g (x 0 ,x i) 1+/3 9 

By considering x\ in a small geodesic sphere centered at xq, we obtain the C l 'd estimate for u. 


Proof for arbitrary u. Let us define 

U(p) : = ^(exp s 0 ,zo(p)) ~ G ( ex Ps 0 ,zo(P)’ S o^o) 

Pg(s) : =Px 0 ,zo(x g (s)). 


Then p g : M —> T£ Q M is C 1 on (0,1), and U : T£ Q M -» M is Lipschitz, and since = U(p g (s)), 
one of the chain rules for the Clarke subdifferential [Cla90, Theorem 2.3.10] combined with 
[Cla90, Proposition 3.3.4 and Corollary] yields for all s 6 (0,1), 

d c c/)(s) C con v {{q s , Dp g (s)) \ q s G d c U(p g (s))} . (8.7) 


Again by [Cla90, Theorem 2.5.1] combined with the representation dU(p) = {linifc^oo DU(pk) \ Pk —> 
(proven as in Corollary 4.24), we see that dU(p g (s)) = O c U(p g (s)) for all s. A tedious but 
routine calculation now yields that 

q s € du(p g (s )) <=^ q s = (D pg (s) exp!? )2o (• ))* (p s ~ DG(x g (s),x 0 , z 0 )) 

for some p s G du(x(s)). Here * is the transpose map, defined by duality using the evaluation 
map by 


( iPpg 0)exp^o,z 0 (')) «»*,«) = K, (D Pg{s) ewg, Z0 (-)) v), V u;* € v € T pg{s) T* So M. 

Recalling that Px 0 ,z 0 ( ■) is the inverse of exp!? , (•) we can rewrite (8.7) as 


d c (j)(s ) C conv { (p s - DG{x g (s),xo,zo),x{s)) \ p s G du(x g (s))} . 


Now for each s G (0,1) any p s G du(x g (s)), and any choice of x s G Oqu (x g (s)), a similar 
argument as the case when u is assumed C 1 yields 


s(p s - DG(x g (s),x 0 , z 0 ), x(s)} = s(DG(x g (s),x s , z s ) - DG(x g (s),x 0 , zq), x(s)) 

> (1 + A]f 1 )(u(x 9 (s)) - G(x g (s), x 0 , 2 0 )) - Cs 1+a 
= (l + A- 1 1 )^s)-Cs 1+a 1 


thus it follows that for s G (0,1), 

d c 4>(s) C (s _1 (l + A]f 1 )^(s) — Cs a , oo) . 

Finally, for those s at which 4> is differentiable [Cla90, Proposition 2.2.2] gives cj)'(s ) G d c <ft(s). 
hence at such s we have 


s</)'(s) > (1 + A x 1 )</>(s) — Cs 1+a . 

Since <f> is Lipschitz, the above inequality holds for a.e. s G [0, d g (xo, xi)], from here on the 
proof follows by integration, arguing as in the case where u is C l . 

□ 
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9. The analogue of the MTW tensor, and its relation to 
(G-QQConv)-(G*-QQConv) 

In [Trul4b], Trudinger defines the condition (G3w) below. The condition reduces to the 
Ma-Trudinger-Wang ((MTW) or sometimes (A3w)) condition on the cost function c from the 
theory of optimal transport (the case G(x,x,z) = —c(x,x) — z, see Section 3.3), which is 
central in questions of regularity. The (MTW) condition and certain stronger variations were 
used in [MTW05, LTW10] to prove local and in [TW09] to prove global a priori C 2 estimates 
of solutions to optimal transport problems, leading to C 2,a regularity. In [FKM13a], it is shown 
under (MTW) that pointwise estimates of Aleksandrov type hold, which can be used to show 
C 1,a regularity of solutions to optimal transport (see also [Liu09]). In a previous paper [GK15], 
we introduce conditions called (QQConv), which can be used as starting points to again prove 
Aleksandrov type estimates in optimal transport, but with lower (G 3 ) regularity of the cost 
function. In [GK15] we also show the (MTW) condition implies (QQConv); the conditions 
(G-QQConv) and (G*-QQConv) we introduce in this paper reduce to (QQConv) in the optimal 
transport case. It is also shown by Loeper in [Loe09] that the (MTW) condition leads to certain 
geometric consequences, and when the cost function is G 4 , it is necessary to obtain regularity 
of solutions to optimal transport. Trudinger uses (G3w) in [Trul4b] to obtain a priori C 2 
estimates for G-convex solutions of the generated Jacobian equations (GJE). 

The goal of this section is to demonstrate that conditions (G-QQConv) and (G*-QQConv) 
are reasonable: in the case of smoother generating function G, the conditions follow from 
Trudinger’s regularity condition (G3w) below. 

In this section, we assume that G is G 4 , in the sense indicated in Theorem 2.5 (i.e., all 
derivatives up to order 4, where at most two derivatives fall on any single variable at once, are 
continuous). We first define Trudinger’s condition (G3w). 

Definition 9.1. Fix x G fl, (p,u) G {(DG,G)(x,x,z) \ (x,x,z) G g}, and a local coordinate 
system near x (denoted by x l ) on M; and define 

A i:j (x,p,u ) := G x i xj (x,exp^ u (p),Z^(p,u)) 

where subscripts refer to coordinate derivatives. 

We say G satisfies (G3w) if for any such triple ( x,p,u ), and any V G T X M and r/ G T*M 
satisfying ( 77 , V) = 0, we have 

T (x,p,u) (V, 7 h V, V) := Dp kPl Aij(x,p, u)V l V 3 ri k r)i > 0, (G3w) 

here pi denote the coordinates induced by x l on the cotangent bundle T*M , and D PkPi are 
second derivatives with respect to these coordinates. Likewise, we say G satisfies (G3s) if there 
is some Co > 0 such that for any triple ( x,p,u ) and any V G T X M and p G T*M satisfying 
( 77 , V) = 0, we have 

T (x,p,u) (V,V,V,V) := Dp kPl A ij (x,p,u)V l V J r] k Vl > co\r]\ 2 g \V\ 2 g . (G3s) 

Similarly, for fixed x G Cl, z G M, and p G j -^-(x,x,z) \ ( x,x,z ) G flj, define 

(x, X, z) + D z ( X , X, z)pi 

= H s k s i(exp? z (p),x, G(expg Jg (p),x, z)). 


A* k i(p,x,z ) : = - 


D~k 


D x iG 

~g7 


x=exp§ z (p) 
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Then we say G satisfies (G3*w) if for any such triple (p, x, z), and any V E T X M and r/ E TfM 
satisfying ( fj, V) = 0, we have 

T%,*, 2 ) {fj, fj, V, V) := D 2 pip .A* kl (p, x, z^fj^V 1 > 0. (G3*w) 


Remark 9.2. We recall here that for any fixed ( x,p,u ), the expression in the definition of 
T( x ,p,u) (■,-,■>■) i s a (2> 2)-tensor over T X M x T X M x T * M x T*M; hence actually independent 
of choice of coordinate systems. Indeed, fix (xo,po,uo)- We take two coordinate systems x l and 
y l near xq on M; x l and y l induce coordinates on T*M locally near (xo,po), we denote these 
by ( x l ,pi ) and ( y l ,qi ); the relation being qj = Pkjf^r- Then if (x,p) = (y,q) are coordinate 
representations of the same point in T*M, 


Aij(x,p , u) 


G yay p (y, exp® u (p), Z%(p, u)) 
G yayP (y, exp® u (p), (p, u)) 
Gycyp (y, exp® u (p), (p, u)) 


dy a dyP 
dx l dxi 
dy° dyP 
dx l dxi 
dy a dyP 
dx l dxi 


d 2 v a 

+ G ya (y, exp % u (p), Z°(p, u))g-g~ 

_ _ f) 2 ?/ q 

+ G x0 (x, exp^ (p), Z, G (p,«)) ^ 

^ dy a dx l dxi 


Since the second term in the last expression above is linear in the pi coordinates, it will vanish 
under two differentiations in those variables. Hence we obtain 


Dp k piAij{x } p, u) = D PkPl Gy ay p(y,exp^ u (p),Z^(p,u)) 


dy a dyP 
dx l dxi 


= G)q r q s G y a y p {y, exp G u (g), Z G (q,u)) 


dy a dy 13 dx k dx l 
dx l dxi dy r dy s 


and thus the expression giving T( Xj p jU ) transforms according to the transformation law 

for (2, 2)-tensors as claimed. A similar argument holds for T * t Pt x,z) ('> ■> '■> ■)■ 


There are numerous known cases in optimal transport when (G3w) holds: see [Loell, KM12, 
LV10, FR09, DG10, FRV12] (examples arising in Riemannian geometry) and [MTW05, TW09, 
LM11, LL12] (more general cost functions). Some cases that do not fit into the optimal transport 
framework that satisfy (G3w) are demonstrated in [JT14, Section 4] (near-field parallel beam 
reflection and near-field refraction) and [Trul4b] (near-field point source reflector with flat 
target). One interesting future direction is to explore how in the near-held point source reflector 
problem with a more general target, the conditions for regularity presented in [KW10] fit within 
the framework of generated Jacobian equations. 

We now devote the remainder of this section toward proving Theorem 2.5. In order to do 
so, we first obtain (G*-QQConv) by adapting the strategy in [GK15, Lemma 2.23], which is 
originally motivated by a computation in [KM10, Proposition 4.6] stemming from the optimal 
transport case. The major difference is, of course that the added nonlinear dependency of 
G on the scalar variable z highly complicates matters. Additionally, due to the restriction 
of the domains g and f), we must be extremely careful in our computations to ensure that 
the quantities involved are always well-defined; this is another subtlety not present in the 
optimal transport case. Finally, since the “source” and “target” domains hi and do not 
play an exactly symmetric role, we must carefully exploit the duality relation between G and 
H to obtain (G-QQConv) from (G*-QQConv). Henceforth we will fix a compact subinterval 
[mq,«q] C (u,u). 
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We comment here that in the lemma below, we will actually make use of (G3*w) instead of 
(G3w); by [Trul4b, Theorem 3.1] it is known that if G satisfies (G3w), then it also satisfies 
(G3*w), and vice versa. 

Lemma 9.3. Suppose G, fl, and ft satisfy the hypotheses of Theorem 2.5 and let uq G [uq, uq], 
xq, x\ G ft cl , x 0 , x\ G ft cl , and x(f) := [xo,xi\ XQ Uq - Also define 

/(f) := G(xi,x(t),H(x 0 ,x(t),u 0 )), t G [0,1]. 

Then, if (xi, x(t), H(xo, x(f), uq)) G 0 for all f G [0,1], there is a constant C > 0 depending on 
['U q . uq], various derivatives of G, ft, and O, but independent of xo, xi, xo, and x\ such that 

f"(t)>-C\f\t)\, Vi G (0,1). 

Proof. First by (DomConv*) and (Unif), note that x(t) is well-defined and contained in f2 cl , 
and in particular 

(x 0 ,x(to),H(xo,x(t),u 0 )) G 0 , Vf G [0,1]. 

For s, t, to G [0,1] we introduce the function 

G(x(s-,t 0 ),x(t),z(t)) 

9[ 0j G z (x(s-t 0 ),x(t 0 ),z(t 0 )y 

where 

Po : = DG(x 0 ,x 0 ,H(x 0 ,x 0 ,u 0 )), pi := DG(x 0 ,x 1 ,H(x 0 ,x 1 ,u 0 )), 

z(t) : = H(x 0 ,x(t),u 0 ) = Z^(( 1 - t)p 0 + tp 1 ,u 0 ), 


DG DG 

Po(to) : = - — (x 0 ,x(t 0 ),z(t 0 )), PI(t 0 ) := ~ — (xi,x(to),z(to)), 

Z 

x(s-,to ) : = exp® (to)|Jg(to) ((l - s)p 0 (t 0 ) + spi(t 0 )). 

Since (xi, x(to), z{to)) G 0 for all to G [0,1] by assumption, (DomConv) ensures that x(s;to) = 
[xo, X i]x(t 0 ) z(t 0 ) w ell-defined and in particular, 

(x(s; t 0 ), x(t 0 ), z(f 0 )) G 0 (9.1) 

for all s, to G [0,1]. Also by (G*-Twist) and the definitions of po(to), Pi (to), and x(s;fo), we 
must have 


®(0; to) = xq, x(l;t 0 ) = xi 


independent of to G [0,1], thus 

fit) = -G z (xi, x(t 0 ), z(t 0 ))g(t, 1;t 0 ), V t, t 0 G [0,1], 


We shall now show that for some constant C > 0, which will be the same one as in the statement 
of this lemma, 


(P_ 

W 


■p(t,s;t 0 ) 


+ s 2 C\(p 1 {t 0 ) -po(t 0 ),x(t 0 ))| 

t=t 0 


(9.2) 
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is a convex function of s G [0,1] which vanishes to first order at s = 0; hence it is nonnegative 
for all s G [0,1]. Taking s = 1 we then obtain 


f"(t 0 ) 




eP_ 

dt 2 


g(t, 1; t 0 ) 


t=to 


> CG z (xi,x(to), ^(to))|(pi(to) -Po{to),x(t Q ))\, 


Vt 0 e [0,1]. 


Further, we shall see by (9.3) below with s = 1, 

\f'{t 0 )\ = -G z (x!, x(t 0 ), z(t 0 ))\{pi(t 0 ) ~ Po(to),x(t 0 )}\, V t 0 G [0,1], 

since G z < 0; thus the lemma will be proved once we check the claims made on (9.2); this we 
do in four steps. 


Step 1. First we show that for every to G [0,1] we have 

d 

—g(t,s-,t 0 )\t=t 0 = s(pi{t 0 ) - po(t 0 ),x(t 0 )). 

Indeed by (4.3), 


(9.3) 


d_ 

dt 


g(t,s;t 0 ) 


(DG(x(s; t 0 ),x(t), z(t)),5(t)) + G z (x(s ; t 0 ), x(t), z(t))z(t) 

G z (x(s;t 0 ), x(t 0 ), z(t 0 )) 

G z (x(s: to), x(t), z(t)) , DG, , . DG, . ... 

^ , , t \ — /. \ TTWx (—(a;*o),a + —(x 0 ,x(t),z(t)),x(t)), 
G z (x(s\to),x(to),z(t 0 )) G z G z 


(9.4) 


thus taking t = to we have (9.3). 


Step 2. Now, we make a series of calculations in the flavor of [KM10, Proposition 4.6]; for our 
second step we show the zeroth and first order parts of the expression (9.2) are zero at s = 0. 
First by further differentiating (9.4) in t and taking t = to, we see 


d 2 

"hi2^(t> ^o) 

t=t 0 

= s(pi{t 0 ) - Po(to),£{to)) 

s{pi(to) -po(to),x(to)) d 


+ 


G z (x(s;t 0 ),x(t 0 ),z(t 0 )) dt 


G z (x(s;t 0 ),x(t),z(t)) 


t=to 


+ ^(-^(s(a; to), x(t), «(t)), ®(t 0 )) 
= / + // + /// + IF. 


t=t Q 


+ ^{^{xo,x(t),z(t)),x(t 0 )) 


t=to 


(9.5) 


Since x(0;to) = xq, we immediately see that for any t, to G [0,1], taking s = 0 in the above 
expression, 


dt 2 


g{t,s;t 0 ) 


= 0 . 


s=0,t=to 
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On the other hand, 


d . . 

—5r(t, s; to) 


(DG(xq, x(t), z(t)),x( 0; t 0 )) 


s= o G z (x 0 ,x(t 0 ),z(t 0 )) 

((1 - t)p 0 + tpi,x(0;t 0 )) 


- p(t 0 )G(x 0 ,x(t),z(t)) 


- p(t 0 )u 0 


G z (x 0 ,x(to),z(t 0 )) 

where p(to) is some expression independent of t, thus differentiating the above twice in t we see 

d 3 


dsdt 2 


g{t,s-to) 


= 0 


S=0,t=tg 


for all t, to G [ 0 , 1 ]. 


Step 3. Next note that since G satisfies (G3w), by [Trul4b, Theorem 3.1] it satisfies (G3*w). 
Then as T* (•, •, •) is a (2, 2)-tensor by Remark 9.2, (G3*w) (and recalling (9.1)) implies there 
exists a constant C > 0 depending only on 12, Cl, [uq.uq], and derivatives of G and H up to 
order 4 (independent of s and to) such that, 

-7 s;to ’ — ^ ((1— s)po(to)+spi(to),x(to),z(to)) (Pl(^o) — Po(^o) > Pi (to) ~ Po(to), x(to), x(to)) 

> -C\(pi(t 0 ) ~ Po{t 0 ),X(to))\. (9.6) 

Step 4- Finally, we work toward showing the convexity of (9.2) in the s variable. Toward this 


end, we first claim that 

d 4 


ds 2 dt 2 


g(t,s-,t 0 ) 


t=t 0 


_nn* 

~ 1 s-,t 0 


+ (pi(to) ~ Po(to),x(to))-Q-2 


d 2 I' s(G zz [(2 - s)po(t 0 ) + spi(t 0 )] + 2DG z ,x(t 0 )) \ 


(9.7) 


where the arguments of DG Z , G zz , and G z are (x(s',to),x(to),z(to)). Since / + IV in (9.5) are 
affine in the variable s, after taking two derivatives in s the terms vanish and do not appear 
in the expression (9.7). On the other hand (hereafter, derivatives of G are to be evaluated at 
(x(s]to),x(to),z(to))] we suppress this notation for brevity), 

III = -(D Hto) + *(*o), x(t 0 )) 

= ~(D ( 7 S) *(to)J(to» - J(to))(po(to)J(to)) 


d_ 

at 


G z (x(s-to),x(t),z(t)) 


t=t 0 


(DG Z , x(to)) + G zz z(to) 
(DG z ,x(t 0 )) + G zz {p 0 (to),x(to)}, 


by (4.3), while 
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thus 


II + III = —(D 


DG 

G z 


x(t 0 ),x(t 0 )) - ( 


IDG 

Vg7 


,x{t 0 ))(p 0 (t 0 ),x(t 0 )) 


+ 


s{pi(t 0 ) -po(to),x(t Q )) 


= -(D 


G z 

DG 


G z 


x(t 0 ),x(t 0 )} - ( 


(( DG Z , x(t 0 )) + G zz (po(t 0 ), x(fo))) 
IDG 




x(t 0 ))({l - s)p 0 (t 0 ) + spi(t 0 ),x(t 0 )) 


2s{DG z ,x(t 0 )) ± 

H-^ -po(to),x(to)} 

vrZ 

+ sG ‘ M2 - ,)w( ‘° ) + ,Mto) '* (to)> {Mto) - 
(gtz 


By comparing to (G3*w) we see differentiating the first two terms above twice in s yields the 
T* t term in (9.7), we obtain the full expression (9.7). 

Now the absolute value of the final factor multiplying (pi(io) — Po{to), x(to)} in (9.7) has 
an upper bound depending on the quantities ||DG Z ||, ||D 2 G 2 ||, ||£)G ZZ ||, ||D 2 G 22 ||, ||DZ)G 2 ||, 
\\D 2 DG z \\, \G z \~\ |i(s;t 0 )|^_ (3 . to) , |®(s;to)| ffe( ,. to) , boMfe (to) , \Pi(to)\g^ to y and |x(io)| 5a(to) ; by 
the compactness of f7 cl and fi cl , all of these quantities have a uniform upper bound depending 
on the interval ['Uq, uq] but independent of xq, x\, xo, and x± (also through (G-Nondeg) via 
(4.1)). Thus combining (9.7) with (9.6) we obtain 


d 2 

ds 2 dt 2 


g(t,s-,t 0 ) 


> -2C\(pi(t 0 ) ~Po{to),x(to))\, 

t=t 0 


Vs £ (0,1) 


for some C > 0 with only dependencies as claimed in the statement of the lemma. This last 
inequality is nothing else but the fact that the auxiliary function (9.2) is indeed convex in s for 
any fixed to- □ 


Corollary 9.4. Let /(f) be as in the previous lemma. Then there exists a M > 1 depending 
on [u,q,uq] but independent of xo, x\, xo, and x\ such that 

m - m < [/(l) - /(*')] + , v * € [0,1], (' 6 [0,1). 

Proof. First, we shall show that 

if 3 t* G [0,1] s.t. > 0, then f(t) >0, V f G (t*, 1], (9.8) 

To see this, let us adapt an argument found at the end of the proof of [Vil09, Theorem 12.46] 

as follows: suppose that (9.8) is false, then f'(t ) < 0 for some t G (t*,l]. In this case, there 

exists to G (t*, 1] which is the first zero of f after t* , that is 

t 0 := inf {f G [t*, 1] | f'(t) = 0} . 

Since f'(t) > 0 for t G (t*, to), Lemma 9.3 gives that ^ log f'(t ) = j^y > —C for any t G (t*, to). 
Integrating this inequality yields 

log /'(f) > log V t G (t*,t 0 ). 

Taking t /* to, we see that log/'(f) remains bounded from below, thus /'(to) > 0, which is a 
contradiction. One key consequence of (9.8) that we exploit is that / cannot have any strict 
local maxima in (0,1). 
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We now return to the main inequality, fixing t E [0,1] and t' E [0,1) we consider two cases, 
according to whether /( 1 ) > fit') or /( 1 ) < fit'). 

First suppose /(1) < We must then have /(0) > fit'), otherwise this will contradict 

(9.8). Now suppose that f(t) > /(0) as otherwise there is nothing to prove, then this easily 
would imply the existence of some strict local maximum of / in ( 0 , 1 ), which is a contradiction. 
Thus we obtain the inequality in this case. 

It remains to consider the main case when /(1) > fit'), to handle this case we follow a refined 
version of the argument in [GK15, Lemma 2.23]. Again assume f(t ) > /(0), and temporarily 
assume > 0. Consider the functions 

fit) : = /(tt), 


then by Cauchy’s mean value theorem, for some t E [0,1] we have 

f(t) - /( 0 ) /(l) - /( 0 ) 

/(I) - fit’) /(1) - /(0) 

fit) 
fit) 


- (l-t')/'(t' + t(l-t'))' 

Since t'+t(l — t') > t! and we have assumed f it!) > 0, (9.8) guarantees that f'(t'+t{l — t')) > 0. 
This and (9.9) in turn imply that f'(tt ) > 0 as well. On the other hand, since t, t' < 1 it is 
clear that 0 < tt < t' + t(l — t') < 1. Thus by (9.8) again we see f > 0 on [tt, t' + t(l — t ')] and 
we can integrate the inequality in Lemma 9.3 over this interval. This yields 

_ ZM _ < e C(t'+t(i-t')-tt) < e c = . M 

fit'+ til-t'))~ 

(note that M > 1). Combined with (9.9) we obtain the desired inequality when fit') > 0. 
Finally suppose that f if ) < 0; by (9.8) we must have fit') < /(0). Let 

t 0 = sup {t" E [0,1] | fit") = fit')} , 


clearly t' < to < 1 and for small enough e > 0 we have fit o + e) > 0. We cannot have t < to, 
as fit o) = fit') < /( 0 ) < /(t) and this would imply the existence of a strict local maximum 
of / on (0,fo)- Thus t E fo, 1] and we redo the calculation leading to (9.9) with the function 
/ e (t) := /(to + e + t(l — to — e)) in place of /, t e := replacing t, and 0 in place of t' to 

obtain 

/(0 - fjto + e) = /e(G - /€(0) < tefjtet) = tj^tp + 6 + tjt - tp - g)) < _ 

/(l)-/(to + e) /e(l) — /e(0) — /'(t) /'(to + e + t(l-to-e)) - 

we are able to obtain the final inequality as f{t o + e) > 0 , hence we may integrate as in the 
previous case of (9.9) for the bound. Finally taking e to 0 and using that 


fit) ~ fi 0) < fit) - fit') = fit) - /(t 0 ), 


lim t e 

e-S>0 


t - to 
1 — to 


< t < 


t 

1 -t' 


we obtain the inequality in this case. 
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□ 

Remark 9.5. Note that (G*-Twist) implies for fixed x, the mapping (x,u) i —> (DH, H)(x,x,u) 
is injective on the set {(x,u) \ (x,x,u) G f}}: suppose for x fixed, and (x±,ui) and (X 2 ,U 2 ) in 
this set we have 


DH(xi,X,Ul) = DH(X2,X,U2), 
H(xi,x,ui) = H(x 2 ,x,u 2 ) =: z. 


Then, m = G(xi,x,z ) and DH(xi,x,G(xi,x, z)) = — ^f(xi,x,z) for i = 1, 2, hence the first 
line above is equivalent to 


DG 

~G7 


(xi ,x,z) 


DG 

~G~ Z 


(x 2 ,x,z). 


Since ( Xi,x,z ) = (xj, x, H(xi, x, Ui)) G 0, by (G*-Twist) we must have x\ = x 2 ■ But then 
clearly also u\ = U2, hence we have injectivity. Moreover, if JJ§(p, z) := G(exp^ z (p), x, z), then 
we have 


DH(exp^ z (p),x,U^(p, z)) =p, 

H(exp? e (p),x,Uj?(p, z)) = z. 

Also note that (G-Twist) implies that for fixed {x,u), the mapping x i->- —j^-(x,x,u) is 
injective on the set {x \ (x,x,u) G f)}: fix x, u and say x\ ^ x 2 with ( x,Xi,u ) G f) (thus 
(x, Xi, H(x, Xi, u)) G 0) for i = 1,2. Now G(x, Xi,H(x, Xi,u )) = u for i = 0, 1, thus by (G-Twist) 
it must be that DG(x, x\, H(x, x\, u)) / DG{x, x 2 , H(x, x 2 , u)). Since —^-(x, Xi, u) = DG(x, Xi, H(x, Xi, u)), 
this gives the claim. Moreover, 

DG(x, exp ^ u (p),H(x, exp % u (p),u)) 

DG(x, exp x, u (P), Z®{p, u)) = p. 

With the above Corollary 9.4 in hand, (G*-QQConv) will be immediate. To obtain (G-QQConv), 
we first show a “dual” version of (G*-QQConv) for the function H ((H*- QQConv) below). We 
then exploit the relation between G and H, along with monotonicity properties in the scalar 
variables to translate this to (G-QQConv). 


DH 

Id 


(x,exp® (p),u) = 


Proof of Theorem 2.5. Corollary 9.4 immediately implies (G*-QQConv). 

To obtain (G-QQConv), first fix a subinterval [«q,wq] C (u,u), and let xo, x\ G 0, x\, 
xo G Q, zq G M with G(x(s),xq,zq) G [uq,uq] for s G [0,1], and x(s) := [xo,xi]^ . x(s) is 
well-defined and remains in Q cI for all s G [0,1] by (Unif) and (DomConv). Keeping in mind 
Remark 9.5, we can follow the proofs of Lemma 9.3 and Corollary 9.4 with the roles of H and 
G, and Cl, and T and T* switched to obtain an analogous version of (G*-QQConv), i.e. there 
exists a constant M > 1 depending on [uq,uq] but not on xq, x\, xq, and x\ such that 

H(x(s),xi,G(x(s),x 0 ,z 0 )) - H(x 0 ,x 1 ,G(x 0 ,x 0 ,zo)) (LP-QQConv) 

< [ H ( x i,xi,G(xi,x 0 ,z 0 )) - H(x(s'),x 1 ,G(x(s'),xo,z 0 ))] + , V s,s' G [0,1). 
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Indeed, we can redo the proof of Lemma 9.3 with the functions 

H(x(s),x{t\s 0 ),u{s)) 

’ S ° ’ H u (x(so),x(t-,so),u(s 0 )) 

g(s) : = -H u (x(s 0 ),xi,u(s 0 ))f(s, 1; s 0 ), 

with 

Po : = DH(xq,xq, G(xq, xq, ~o)), Pi = DH(xi,xq,G(xi,xq,zo)), 
u(s) = - s)po + spi,z 0 ) = G(x(s),x 0 ,z 0 ), 


Po(so) '■ 
x(t;s 0 ) : 


--n-(x(so),x 0 ,u{s 0 )), pi (so) := 
tLu 

ex P5 S o),«(so)(( 1 - Opo(sq) + tpi(s 0 )). 


DH 


(x(so),xi,u(s 0 )), 


Since u(so) E [uq,uq] C («, it) for any so [0,1], (DomConv*) implies that x(t;so) is well- 
defined and remains in 14 cl for all t, so E [0,1]. As before, (using (G3w) in place of (G3*w)) 
we eventually obtain the existence of a constant C > 0 with the correct dependencies for which 
g"(s) > —C\g'(s)\ for all s E [0,1], and the same proof as Corollary 9.4 yields (LP-QQConv). 

Let us write zi(s) := H(x(s),xi,G(x(s),xo,zo)), and fix some s E [0,1], s' E [0,1). Rear¬ 
ranging (LF-QQConv), taking G(x(s),x i, •) of both sides, and using that G z < 0 we obtain 


G(x(s),xi,zi(0)) 

< G(x(s),xi,Z!(s) - ^^7 [~i(l) - zi(s')] + ) 

< G(x(s),xi,H(x(s),xi,G(x(s),x 0 ,zo))) + sup|GL| y~~j, [al( 1) - 2i(s')] + 

= G(x(s),x 0 ,z 0 ) + sup|G , z | i M ^ / [zi(l) - zi(s')] + , 

where here the supremum of |G Z | is over (x,x) E 14 cl x f4 cl and 

«i(s)~ M 1 ) “ 2 i( s ')] + < * < *1 (»)• 

Now since G(x(s), xq, zq) remains in the interval [uq,uq], by using (G-Nondeg) and (4.1), we 
see there is a finite upper bound C\ > 0 depending only on G (through H), 14, 14, and [uq,«q] 
such that 

2i(l) - Zi(s') = J ^-zi(s)ds < Gi(l - s'). 

Thus the supremum of \G Z \ can be taken over the interval [zi(s) — C\ M. ^i(s)], and in turn for 
a C 2 > 0 with the same dependencies as C\ we find 

Ms 

G(x(s),xi,zi(0)) - G(x(s),xo, zo) < C 2 ^ ^ [H(xi, xi, G(x±, x 0 , z 0 )) - zi(s')] + . (9.10) 

First suppose G(xi,x\, zi(s')) < G(xi,xq, zo), we then calculate that 

21 (s') = H(xi,xi,G(xi,xi,zi(s'))) 

> H(xi,xi,G(xi,x 0 ,zo)), 


hence by (9.10) we obtain (G-QQConv) in this case. 
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Otherwise, 

[H(xi,xi,G(xi,x 0 ,z 0 )) - 2 i(s')] + 

= H(xi,xi,G(xi,xq, zo)) - H(x 1 ,x 1 ,G(xi,x 1 ,z 1 (s'))) 

< sup|i^ ?t ||G(xi,xi,2:i(s / )) - G{xi,xq,zq)\ 

= sup|i? M | [G(xi,xi,zi(s')) - G(xi,x 0 ,z 0 )] + 

where the supremum above is over u G [G(x\, xq, zo), G(x\, x\, zi(s 7 ))]. Again, this supremum 
then has a finite upper bound C 3 > 0 depending only on G, fl, f2, and [uq,uq], and by (9.10) 
we obtain (G-QQConv) with the constant max { 1 , C2C3M}. 

□ 
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